1 2

Mehul Damani PRO

mehuldamani

https://damanimehul.github.io

AI & ML interests

Reinforcement Learning, Large Language Models

Recent Activity

published a model about 9 hours ago

mehuldamani/fromRLVR_qwen3_8b_medical_rlcr_multiple

published a model about 17 hours ago

mehuldamani/format_train_rlvr_qwen3_8b_medical_rlcr_multiple

published a model about 21 hours ago

mehuldamani/qwen3_8b_medical_rlcr_multiple_zeroBrierWeight

View all activity

Organizations

None yet

Collections 1

Papers 4

models 164

mehuldamani/qwen3_8b_medical_rlcr_multiple_moreThanOneCorrectnessPoint_0BrierWeight

Updated about 23 hours ago

mehuldamani/qwen3_8b_medical_rlcr_multiple_moreThanOneCorrectnessPoint_lowerBrierWeightPoint1

Updated 1 day ago

mehuldamani/qwen3_8b_medical_rlvr_multi_moreThanOneCorrectnessPoint1

Updated 2 days ago

mehuldamani/qwen3_8b_medical_rlcr_multiple_moreThanOneCorrectnessPoint1

Updated 2 days ago

mehuldamani/qwen3_8b_medical_rlvr_multi_moreThanOneCorrectnessPoint

Updated 3 days ago

mehuldamani/qwen3_8b_medical_rlvr_multi

Updated 8 days ago

mehuldamani/qwen3_8b_medical_rlcr_single_judgeYesPartialCredit

Updated 9 days ago

View 164 models

datasets 47

mehuldamani/medDataset_25k

Viewer • Updated 10 days ago • 75k • 139

mehuldamani/medDataset

Viewer • Updated 11 days ago • 1.29M • 94

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

Viewer • Updated 13 days ago • 2k • 17

mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain

Viewer • Updated 14 days ago • 2k • 10

mehuldamani/ambigQA

Viewer • Updated 17 days ago • 12k • 93

mehuldamani/judge-new-sft-instruct

Viewer • Updated 29 days ago • 100 • 9

mehuldamani/judge-new-sft-base

Viewer • Updated 29 days ago • 100 • 13

mehuldamani/judge-new-instruct

Viewer • Updated 29 days ago • 100 • 15

mehuldamani/judge-new-sft

Viewer • Updated 29 days ago • 100 • 16

mehuldamani/judge-new-base

Viewer • Updated 29 days ago • 100 • 13

View 47 datasets

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

Papers 4

models 164

mehuldamani/fromRLVR_qwen3_8b_medical_rlcr_multiple

mehuldamani/format_train_rlvr_qwen3_8b_medical_rlcr_multiple

mehuldamani/qwen3_8b_medical_rlcr_multiple_zeroBrierWeight

mehuldamani/qwen3_8b_medical_rlcr_multiple_moreThanOneCorrectnessPoint_0BrierWeight

mehuldamani/qwen3_8b_medical_rlcr_multiple_moreThanOneCorrectnessPoint_lowerBrierWeightPoint1

mehuldamani/qwen3_8b_medical_rlvr_multi_moreThanOneCorrectnessPoint1

mehuldamani/qwen3_8b_medical_rlcr_multiple_moreThanOneCorrectnessPoint1

mehuldamani/qwen3_8b_medical_rlvr_multi_moreThanOneCorrectnessPoint

mehuldamani/qwen3_8b_medical_rlvr_multi

mehuldamani/qwen3_8b_medical_rlcr_single_judgeYesPartialCredit

datasets 47

mehuldamani/medDataset_25k

mehuldamani/medDataset

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain

mehuldamani/ambigQA

mehuldamani/judge-new-sft-instruct

mehuldamani/judge-new-sft-base

mehuldamani/judge-new-instruct

mehuldamani/judge-new-sft

mehuldamani/judge-new-base

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

models 164 Sort: Recently updated

datasets 47 Sort: Recently updated

models 164

datasets 47