This work addresses a domain-specific gap in LLM capabilities by providing both a targeted dataset and model fine-tuning method, enhancing precision and affective-semantic comprehension in classical poetry, a challenging niche for general LLMs.

Implementation guide

Improving automated translation, interpretation, and emotional analysis of classical Chinese poetry for academic, educational, or cultural tools.
Consider leveraging the CCPoetry-49K dataset and LoRA fine-tuning approach on Qwen2.5 to develop or improve domain-specific LLM applications in classical literature or cultural heritage.

Agents

Relevance

3.4/5

TAHOE: Text-to-SQL with Automated Hint Optimization from Experience

Impact: MediumTarget: Dev

Authored by arXiv Agents

TAHOE: Text-to-SQL with Automated Hint Optimization from Experience

Executive summary

Tahoe is a system that improves Text-to-SQL performance by dynamically optimizing hints derived from user and compiler feedback without needing model fine-tuning.

Technical implication

Tahoe enables more robust and adaptable Text-to-SQL applications in production by reducing dependence on costly fine-tuning and efficiently handling dialects, schemas, and user preferences through learned hints.

Implementation guide

Deploying Text-to-SQL interfaces in real-world environments that require handling diverse SQL dialects, complex schemas, and evolving user intents with improved accuracy without retraining large models.
Incorporate dynamic hint optimization pipelines like Tahoe to improve Text-to-SQL systems’ robustness and accuracy without costly model updates.

LLMs

Relevance

3.4/5

Which Models Are Our Models Built On? Auditing Invisible Dependencies in Modern LLMs

Impact: MediumTarget: Dev

Authored by arXiv LLMs

Which Models Are Our Models Built On? Auditing Invisible Dependencies in Modern LLMs

Executive summary

ModSleuth is introduced to automatically reconstruct complex dependency graphs across modern LLM pipelines by extracting and verifying model dependencies from public artifacts. This reveals hidden dependencies, license obligations, and documentation inconsistencies in LLM development.

Technical implication

Understanding hidden and recursive model dependencies is critical for transparency, legal compliance, and reproducibility in LLM development, as well as ensuring reliable evaluation and licensing adherence.

Implementation guide

Use ModSleuth to audit and visualize dependency graphs of LLMs in projects, enabling developers and organizations to verify source provenance, detect hidden license chains, and reconcile documentation inconsistencies.
Incorporate ModSleuth or similar tools to systematically track and verify model dependencies throughout LLM training and deployment pipelines to ensure transparency and compliance.

Agents

Relevance

3.4/5

APPO: Agentic Procedural Policy Optimization

Impact: MediumTarget: Dev

Authored by arXiv Agents

APPO: Agentic Procedural Policy Optimization

Executive summary

APPO is a novel reinforcement learning method that improves agent decision making by fine-grained credit assignment at token-level branches rather than coarse tool-call boundaries.

Technical implication

This approach enables more precise and interpretable reinforcement learning for multi-turn language model agents interacting with tools, overcoming shortcomings of heuristic-based credit assignment and improving downstream task performance.

Implementation guide

Enhancing RL training of language model-based agents in complex tool-using tasks like multi-step reasoning, interactive assistance, or procedural workflows.
Consider applying APPO's fine-grained branching and credit assignment methods to improve RL agent training efficiency and performance in multi-turn tool use settings.

Agents

Relevance

2.9/5

SPEA2$^+$: Improved Density Estimation in SPEA2 with Provable Runtime Guarantees

Impact: MediumTarget: Dev

Authored by arXiv Agents

SPEA2$^+$: Improved Density Estimation in SPEA2 with Provable Runtime Guarantees

Executive summary

This paper analyzes the runtime of the SPEA2 evolutionary algorithm and proposes an improved variant, SPEA2+, which provides better diversity handling and runtime guarantees on benchmark problems.

Technical implication

Improved theoretical understanding and performance guarantees of SPEA2+ enhance reliable multi-objective optimisation, which is crucial for AI systems requiring balanced optimization across multiple criteria, improving optimization quality and efficiency.

Implementation guide

Use SPEA2+ for multi-objective optimization tasks in AI and machine learning workflows where efficient coverage of Pareto fronts and maintaining solution diversity are critical, such as hyperparameter tuning or model architecture search.
Consider adopting SPEA2+ over SPEA2 for multi-objective optimization tasks to achieve provable runtime guarantees and better solution diversity, especially in complex problem settings.

1…9 10

Decoding the Next Frequencyof Artificial Intelligence.

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

System Report for CCL25-Eval Task 5: New Dataset and LoRA-Fine-Tuned Qwen2.5

TAHOE: Text-to-SQL with Automated Hint Optimization from Experience

Which Models Are Our Models Built On? Auditing Invisible Dependencies in Modern LLMs

APPO: Agentic Procedural Policy Optimization

SPEA2$^+$: Improved Density Estimation in SPEA2 with Provable Runtime Guarantees

Decoding the Next Frequency
of Artificial Intelligence.