dhruva-sarma
's Collections
Natural Language (LLM, NLP etc)
updated
Toward Self-Improvement of LLMs via Imagination, Searching, and
Criticizing
Paper
•
2404.12253
•
Published
•
54
FlowMind: Automatic Workflow Generation with LLMs
Paper
•
2404.13050
•
Published
•
33
How Far Can We Go with Practical Function-Level Program Repair?
Paper
•
2404.12833
•
Published
•
6
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of
Diverse Models
Paper
•
2404.18796
•
Published
•
68
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
•
2405.01535
•
Published
•
119
An Introduction to Vision-Language Modeling
Paper
•
2405.17247
•
Published
•
87
From RAGs to rich parameters: Probing how language models utilize
external knowledge over parametric information for factual queries
Paper
•
2406.12824
•
Published
•
20
Tokenization Falling Short: The Curse of Tokenization
Paper
•
2406.11687
•
Published
•
15
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen
Reference Content
Paper
•
2406.11811
•
Published
•
16
GLiNER multi-task: Generalist Lightweight Model for Various Information
Extraction Tasks
Paper
•
2406.12925
•
Published
•
23
HARE: HumAn pRiors, a key to small language model Efficiency
Paper
•
2406.11410
•
Published
•
38
Judging the Judges: Evaluating Alignment and Vulnerabilities in
LLMs-as-Judges
Paper
•
2406.12624
•
Published
•
36
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
•
2406.15319
•
Published
•
62
Octo-planner: On-device Language Model for Planner-Action Agents
Paper
•
2406.18082
•
Published
•
47
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks,
and Refusals of LLMs
Paper
•
2406.18495
•
Published
•
12
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
96
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
•
2407.09025
•
Published
•
130
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Paper
•
2407.13623
•
Published
•
53
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Paper
•
2407.12854
•
Published
•
29
Building and better understanding vision-language models: insights and
future directions
Paper
•
2408.12637
•
Published
•
124
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Paper
•
2408.14717
•
Published
•
24
Generative Verifiers: Reward Modeling as Next-Token Prediction
Paper
•
2408.15240
•
Published
•
13
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
•
2409.02795
•
Published
•
71
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Paper
•
2409.06666
•
Published
•
55
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question
Answering
Paper
•
2409.06595
•
Published
•
37
Not All LLM Reasoners Are Created Equal
Paper
•
2410.01748
•
Published
•
28