Armen Jeddi's picture

Armen Jeddi

armenjeddi

·

https://armenjeddi.github.io/

AI & ML interests

VLMs, Test time scaling, post-training

Recent Activity

new activity 9 days ago

armenjeddi/MedBridgeRL-OctoMed-7B-PMC-VQA-RL:Add model card and metadata

authored a paper 15 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

submitted a paper 15 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

View all activity

Organizations

None yet

upvoted a paper 15 days ago

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

Paper • 2603.01301 • Published 16 days ago • 8

upvoted a paper 26 days ago

KeDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments

Paper • 2504.15364 • Published Apr 21, 2025 • 4

upvoted a paper about 1 month ago

LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

Paper • 2602.11451 • Published Feb 11 • 15

upvoted 2 collections about 1 month ago

LoopFormer

Models trained in the ICLR2026 paper: LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation • 17 items • Updated 27 days ago • 2

PC-GRPO

Qwen2.5-VL-3B & 7B models trained with PC-GRPO in the paper: Puzzle Curriculum GRPO for Vision-Centric Reasoning • 9 items • Updated Feb 12 • 3

upvoted 3 papers 3 months ago

TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior

Paper • 2512.20757 • Published Dec 23, 2025 • 18

EasyV2V: A High-quality Instruction-based Video Editing Framework

Paper • 2512.16920 • Published Dec 18, 2025 • 18

Puzzle Curriculum GRPO for Vision-Centric Reasoning

Paper • 2512.14944 • Published Dec 16, 2025 • 36