Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2407.00320

Papers - Training - Brier Score - Probabilistic Accuracy

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published 10 days ago • 33

Papers - Math - TabMWP

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published 10 days ago • 33

Planning-with-LLM

about 11 hours ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published 13 days ago • 47
Adaptable Logical Control for Large Language Models

Paper • 2406.13892 • Published 19 days ago • 1
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Paper • 2406.19215 • Published 11 days ago • 26
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

Paper • 2405.14831 • Published May 23 • 2

ashawkey/LGM

Text-to-3D • Updated Jun 3 • 98
LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published 10 days ago • 33
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging

Paper • 2407.01470 • Published 7 days ago • 4

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published 21 days ago • 29
From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Paper • 2406.12824 • Published 20 days ago • 20
Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published 21 days ago • 13
Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

Paper • 2406.11817 • Published 21 days ago • 13

Relevant-Papers-Midterm

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Paper • 2402.14848 • Published Feb 19 • 18
The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6 • 47
CRAG -- Comprehensive RAG Benchmark

Paper • 2406.04744 • Published Jun 7 • 38
Transformers meet Neural Algorithmic Reasoners

Paper • 2406.09308 • Published 25 days ago • 43

Papers - Llama 3 - Fine-tuning

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22 • 38
LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published 10 days ago • 33

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Paper • 2404.02575 • Published Apr 3 • 46
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 52
SnapKV: LLM Knows What You are Looking for Before Generation

Paper • 2404.14469 • Published Apr 22 • 23
FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17 • 32

Papers - Math - GSM8K

Training Verifiers to Solve Math Word Problems

Paper • 2110.14168 • Published Oct 27, 2021 • 4
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 17
LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published 10 days ago • 33

Daily paper that is inspiring (abstract is enough)

World Model on Million-Length Video And Language With RingAttention

Paper • 2402.08268 • Published Feb 13 • 35
Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 77
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 91
FiT: Flexible Vision Transformer for Diffusion Model

Paper • 2402.12376 • Published Feb 19 • 48

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs