COMETA: A Corpus for Medical Entity Linking in the Social Media Paper • 2010.03295 • Published Oct 7, 2020 • 2
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent Paper • 2312.10003 • Published Dec 15, 2023 • 36
Axiomatic Preference Modeling for Longform Question Answering Paper • 2312.02206 • Published Dec 2, 2023 • 7
Reward models on the hub Collection UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 25
Improving Large Language Model Fine-tuning for Solving Math Problems Paper • 2310.10047 • Published Oct 16, 2023 • 5
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 242
Lemur: Harmonizing Natural Language and Code for Language Agents Paper • 2310.06830 • Published Oct 10, 2023 • 31
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors Paper • 2310.08529 • Published Oct 12, 2023 • 17
RoboCook: Long-Horizon Elasto-Plastic Object Manipulation with Diverse Tools Paper • 2306.14447 • Published Jun 26, 2023 • 6