LLMsMedium impactFor DevarXiv LLMs · June 10, 2026

Which Models Are Our Models Built On? Auditing Invisible Dependencies in Modern LLMs

ModSleuth is introduced to automatically reconstruct complex dependency graphs across modern LLM pipelines by extracting and verifying model dependencies from public artifacts. This reveals hidden dependencies, license obligations, and documentation inconsistencies in LLM development.
Signal strength3.4/5·arXiv LLMs

ModSleuth is introduced to automatically reconstruct complex dependency graphs across modern LLM pipelines by extracting and verifying model dependencies from public artifacts. This reveals hidden dependencies, license obligations, and documentation inconsistencies in LLM development.

TL;DR

ModSleuth is introduced to automatically reconstruct complex dependency graphs across modern LLM pipelines by extracting and verifying model dependencies from public artifacts. This reveals hidden dependencies, license obligations, and documentation inconsistencies in LLM development.

What happened

Researchers developed ModSleuth, an agentic system that recursively identifies and verifies direct and indirect dependencies among large language models from publicly available sources. Applying it to four major LLM releases, they uncovered over a thousand source-verified dependencies, exposing complex multi-hop relationships and discrepancies between released and training-time artifacts.

Why it matters

Understanding hidden and recursive model dependencies is critical for transparency, legal compliance, and reproducibility in LLM development, as well as ensuring reliable evaluation and licensing adherence.

Generating deep dive...

AI-powered analysis takes a few seconds