AI Repos Worth Watching
GitHub's trending AI repos ranked and scored. Updated daily.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
An agentic skills framework & software development methodology that works.
Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini CLI, and more.
Taste-Skill - gives your AI good taste. stops the AI from generating boring, generic slop
Production-grade engineering skills for AI coding agents.
PM Skills Marketplace: 100+ agentic skills, commands, and plugins — from discovery to strategy, execution, launch, and growth.
Open source repository of plugins primarily intended for knowledge workers to use in Claude Cowork
Official Compound Engineering plugin for Claude Code, Codex, Cursor, and more
A skill file for removing AI tells from prose
OpenAI Plugins
754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
The agent that grows with you
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Hermes WebUI: The best way to use Hermes Agent from the web or from your phone!
Learn it. Build it. Ship it for others.
open-source healthcare ai
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
Memory engine and app that is extremely fast, scalable. The Memory API for the AI era.
The open alternative to Salesforce, designed for AI.
A meta-skill that designs domain-specific agent teams, defines specialized agents, and generates the skills they use.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI, VSCode Agent, Warp.dev, Windsurf, Xcode, Z.ai Code, Dia & v0. (And other Open Sourced) System Prompts, Internal Tools & AI Models
DigitalPlat FreeDomain: Free Domain For Everyone
Drop-in JSON replacement for all AI pipelines. 79% fewer tokens. JSON scores 53.6% comprehension at scale, GCF scores 90.5%. Superpowers for graph-shaped data.
GCF is a new compact JSON replacement format designed to improve token efficiency and comprehension in AI pipelines, especially for graph-shaped data.
Auto-updated index of MCP servers shipping on GitHub, refreshed every 15 minutes
An auto-updated GitHub index listing active MCP servers related to AI models and tooling is refreshed every 15 minutes.
🛠️ Run and experiment with Claude Code safely in an isolated Docker sandbox, protecting your files and projects from unwanted changes.
A Docker sandbox environment has been released to safely run and experiment with Claude Code, isolating files and projects from unwanted changes.
Build a multi-collection RAG system using LlamaIndex and Qdrant in a Docker environment.
A Docker-based multi-collection retrieval-augmented generation system combining LlamaIndex and Qdrant is provided for scalable RAG deployments.
Production ready toolkit to run AI locally
RunanywhereAI has released a production-ready C++ toolkit designed to run AI models locally across multiple platforms including mobile and web.
A Next-Generation Training Engine Built for Ultra-Large MoE Models
InternLM/xtuner is a new training engine optimized for ultra-large Mixture-of-Experts (MoE) models to enhance their training efficiency and scalability.
🚀 Build a fast inference engine for the QWEN3-0.6B model using CUDA, optimizing performance with minimal dependencies for efficient learning and practice.
A CUDA-based fast inference engine was developed for the QWEN3-0.6B model focusing on performance optimization with minimal dependencies.
💥 Optimize linear attention models with efficient Triton-based implementations in PyTorch, compatible across NVIDIA, AMD, and Intel platforms.
The simboco/flash-linear-attention repository provides Triton-based PyTorch implementations to optimize linear attention models efficiently across multiple hardware platforms.
Experimenting with Pinecone as vector data continues to take center stage in AI-native systems. The purpose of this project is to explore the core capabilities, and better understand what is possible with Pinecone.
This project experiments with Pinecone, a vector database widely used in AI-native systems, to explore its core capabilities and potential applications.
🚀 Simplify API management with Ollama API Pool, a robust solution for seamless integration and optimized performance in your projects.
Ollama API Pool is a JavaScript-based tool designed to simplify management and load balancing of LLM APIs like Ollama and OpenAI in edge and serverless environments.