38 488 5932

Diwank Tomer PRO

diwank

https://diwank.name

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

facebook/research-plan-gen

liked a model 4 days ago

tencent/HY-MT1.5-1.8B

liked a model 4 days ago

tencent/WeDLM-8B-Base

View all activity

Organizations

upvoted a paper 14 days ago

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 17 days ago • 24

upvoted a collection 14 days ago

Gemma Scope 2

Collection

11 items • Updated 17 days ago • 15

upvoted a paper 16 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 19 days ago • 59

upvoted an article about 1 month ago

Article

Norm-Preserving Biprojected Abliteration

Nov 6, 2025

•

upvoted a paper about 1 month ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 53

upvoted a paper about 2 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 105

upvoted 2 collections about 2 months ago

The Bestiary

Collection

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 77

Nemotron RAG

Collection

14 items • Updated 13 days ago • 54

upvoted a paper about 2 months ago

Drax: Speech Recognition with Discrete Flow Matching

Paper • 2510.04162 • Published Oct 5, 2025 • 27

upvoted 3 papers 2 months ago

upvoted 2 articles 2 months ago

Article

What makes good reasoning data

Oct 30, 2025

•

Article

Projected Abliteration

Oct 25, 2025

•

upvoted 6 papers 3 months ago

The Markovian Thinker

Paper • 2510.06557 • Published Oct 8, 2025 • 30

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 174

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 538

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training

Paper • 2509.25758 • Published Sep 30, 2025 • 22

DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

Paper • 2509.26603 • Published Sep 30, 2025 • 16

Diwank Tomer PRO

AI & ML interests

Recent Activity

Organizations

diwank's activity

Norm-Preserving Biprojected Abliteration

What makes good reasoning data

Projected Abliteration