leonardlin 's Collections TOREAD
updated
A Survey on Data Selection for Language Models
Paper
• 2402.16827
• Published
• 4
Instruction Tuning with Human Curriculum
Paper
• 2310.09518
• Published
• 3
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
Paper
• 2312.05934
• Published
• 1
Language Models as Agent Models
Paper
• 2212.01681
• Published
Beyond Language Models: Byte Models are Digital World Simulators
Paper
• 2402.19155
• Published
• 53
StarCoder 2 and The Stack v2: The Next Generation
Paper
• 2402.19173
• Published
• 154
Polaris: A Safety-focused LLM Constellation Architecture for Healthcare
Paper
• 2403.13313
• Published
• 2
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts
for Instruction Tuning on General Tasks
Paper
• 2401.02731
• Published
• 3
On the Measure of Intelligence
Paper
• 1911.01547
• Published
• 5
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Paper
• 2405.05904
• Published
• 6
A Survey on Large Language Models with Multilingualism: Recent Advances
and New Frontiers
Paper
• 2405.10936
• Published
• 1
Human-like Episodic Memory for Infinite Context LLMs
Paper
• 2407.09450
• Published
• 62
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Paper
• 2407.09435
• Published
• 23
The Impact of Hyperparameters on Large Language Model Inference
Performance: An Evaluation of vLLM and HuggingFace Pipelines
Paper
• 2408.01050
• Published
• 9
OpenResearcher: Unleashing AI for Accelerated Scientific Research
Paper
• 2408.06941
• Published
• 32
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
• 2408.06292
• Published
• 128
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data
Assessment and Selection for Instruction Tuning of Language Models
Paper
• 2408.02085
• Published
• 19
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
• 2408.03314
• Published
• 63