Costa Pissaris's picture

28 30

Costa Pissaris

somtimz

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Memory in the Age of AI Agents

upvoted a paper 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

liked a Space 2 months ago

HuggingFaceFW/blogpost-fineweb-v1

View all activity

Organizations

upvoted a paper 4 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 19 days ago • 124

upvoted a paper 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501

upvoted a paper 3 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 228

upvoted an article 4 months ago

Article

Fine-tune Llama 3 with ORPO

Apr 22, 2024

•

241

upvoted a collection 6 months ago

Gemma 3n

4 items • Updated Jul 10, 2025 • 255

upvoted a collection 8 months ago

Self-improving LLMs

17 items • Updated Mar 27, 2025 • 2

upvoted a paper 8 months ago

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Paper • 2502.01839 • Published Feb 3, 2025 • 10

upvoted a paper 9 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 21

upvoted an article 9 months ago

Article

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

May 7, 2024

•

3

upvoted an article 10 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Jul 29, 2024

•

365

upvoted a collection 10 months ago

Gemma 3 Release

28 items • Updated Aug 11, 2025 • 577

upvoted an article about 1 year ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

204

upvoted 2 papers over 1 year ago

RAG Does Not Work for Enterprises

Paper • 2406.04369 • Published May 31, 2024 • 1

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6, 2024 • 29

upvoted a collection almost 2 years ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 46

upvoted a paper almost 2 years ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 160

upvoted 4 papers about 2 years ago

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 49

Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 47

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 243

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 77