-
Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies
Paper • 2502.02533 • Published • 4 -
Self-Adapting Language Models
Paper • 2506.10943 • Published • 7 -
Small Language Models for Efficient Agentic Tool Calling: Outperforming Large Models with Targeted Fine-tuning
Paper • 2512.15943 • Published • 3
Sukesh Perla
hitchhiker3010
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 12 hours ago
Environment Hub
reacted
to
sergiopaniego's
post
with 🔥
about 18 hours ago
New TRL + OpenEnv example! 💥
Fine tune an LLM for playing Sudoku using an RL env via OpenEnv
Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook.
Enjoy!
Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb
Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py
upvoted
an
article
3 days ago
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective