AI Repos Worth Watching
GitHub's trending AI repos ranked and scored. Updated daily.
Pre-indexed code knowledge graph, auto syncs on code changes, for Claude Code, Codex, Gemini, Cursor, OpenCode, AntiGravity, Kiro, and Hermes Agent — fewer tokens, fewer tool calls, 100% local
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini CLI, and more.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Your Personal AI super intelligence. Private, Simple and extremely powerful.
Learn it. Build it. Ship it for others.
Academic Research Skills for Claude Code: research → write → review → revise → finalize
⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
A skill file for removing AI tells from prose
#1 Persistent memory for AI coding agents based on real-world benchmarks
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
Open source repository of plugins primarily intended for knowledge workers to use in Claude Cowork
Taste-Skill - gives your AI good taste. stops the AI from generating boring, generic slop
TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.
Skills for Real Engineers. Straight from my .claude directory.
Drop-in JSON replacement for all AI pipelines. 79% fewer tokens. JSON scores 53.6% comprehension at scale, GCF scores 90.5%. Superpowers for graph-shaped data.
GCF is a new compact JSON replacement format designed to improve token efficiency and comprehension in AI pipelines, especially for graph-shaped data.
Auto-updated index of MCP servers shipping on GitHub, refreshed every 15 minutes
An auto-updated GitHub index listing active MCP servers related to AI models and tooling is refreshed every 15 minutes.
🛠️ Run and experiment with Claude Code safely in an isolated Docker sandbox, protecting your files and projects from unwanted changes.
A Docker sandbox environment has been released to safely run and experiment with Claude Code, isolating files and projects from unwanted changes.
Build a multi-collection RAG system using LlamaIndex and Qdrant in a Docker environment.
A Docker-based multi-collection retrieval-augmented generation system combining LlamaIndex and Qdrant is provided for scalable RAG deployments.
Production ready toolkit to run AI locally
RunanywhereAI has released a production-ready C++ toolkit designed to run AI models locally across multiple platforms including mobile and web.
A Next-Generation Training Engine Built for Ultra-Large MoE Models
InternLM/xtuner is a new training engine optimized for ultra-large Mixture-of-Experts (MoE) models to enhance their training efficiency and scalability.
🚀 Build a fast inference engine for the QWEN3-0.6B model using CUDA, optimizing performance with minimal dependencies for efficient learning and practice.
A CUDA-based fast inference engine was developed for the QWEN3-0.6B model focusing on performance optimization with minimal dependencies.
💥 Optimize linear attention models with efficient Triton-based implementations in PyTorch, compatible across NVIDIA, AMD, and Intel platforms.
The simboco/flash-linear-attention repository provides Triton-based PyTorch implementations to optimize linear attention models efficiently across multiple hardware platforms.
Experimenting with Pinecone as vector data continues to take center stage in AI-native systems. The purpose of this project is to explore the core capabilities, and better understand what is possible with Pinecone.
This project experiments with Pinecone, a vector database widely used in AI-native systems, to explore its core capabilities and potential applications.
🚀 Simplify API management with Ollama API Pool, a robust solution for seamless integration and optimized performance in your projects.
Ollama API Pool is a JavaScript-based tool designed to simplify management and load balancing of LLM APIs like Ollama and OpenAI in edge and serverless environments.