-
Kyle1668/labeled_alignment_discourse_v1
Viewer • Updated • 1.07k • 11 -
Kyle1668/alignment-classifier-documents-unlabeled
Viewer • Updated • 57.9k • 12 -
geodesic-research/anthropic-propensity-evals-human-written-refined
Viewer • Updated • 4.28k • 921 • 1 -
Kyle1668/sfm-finetuning-dataset-v1.5
Viewer • Updated • 306k • 9
Kyle O'Brien PRO
Kyle1668
AI & ML interests
pretraining, alignment, open-source
Recent Activity
updated
a model
about 20 hours ago
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_mbt
published
a model
about 20 hours ago
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_mbt
updated
a dataset
4 days ago
Kyle1668/fewshot-discourse-grounded-misalignment-evals