Sukesh Perla's picture

Sukesh Perla

hitchhiker3010

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 12 hours ago

Environment Hub

reacted to sergiopaniego's post with 🔥 about 18 hours ago

New TRL + OpenEnv example! 💥 Fine tune an LLM for playing Sudoku using an RL env via OpenEnv Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook. Enjoy! Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py

upvoted an article 3 days ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

View all activity

Organizations

Collections 8

View 8 collections

spaces 2

Token Visualizer

Visualize tokens from text using a tokenizer

Quickdraw

models 0

None public yet

datasets 0

None public yet