-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 12 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 7 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 12 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 86
daje kang
daje
·
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 hour ago
boltz-community/boltz-1
updated
a dataset
17 days ago
daje/Ko-SciecneQA
updated
a model
21 days ago
daje/llama3.1-8B-naver_news-summary-llamafactory
Organizations
None yet
Collections
1
models
18
daje/Qwen2-VL-72B-instruct-ScienceQA
Updated
daje/llama3.1-8B-naver_news-summary-llamafactory
Updated
•
6
daje/code-llama-7b-text-to-sql
Updated
daje/chapter5_code-llama3-8B-text-to-sql-ver0.1
Updated
daje/chapter5_psychological_chatbots
Updated
daje/20240830_model
Updated
daje/meta-llama3.1-8B-qna-koalpaca-v1.1
Text Generation
•
Updated
•
9
daje/model_output
Updated
daje/chinese_results_20240729_021938
Updated
daje/code-llama3-8B-text-to-sql-ver0.1
Text Generation
•
Updated
•
4
datasets
9
daje/Ko-SciecneQA
Viewer
•
Updated
•
12.7k
•
39
daje/keyword_summary
Viewer
•
Updated
•
1k
•
17
daje/kotext-to-sql-v1
Viewer
•
Updated
•
262k
•
36
daje/mistral_tokenized_en_wiki
Viewer
•
Updated
•
16.1M
•
164
daje/mistral_tokenized_ko_wiki
Viewer
•
Updated
•
1.7M
•
33
daje/tokenized_enwiki
Viewer
•
Updated
•
16.4M
•
178
daje/tokenized_kowiki
Viewer
•
Updated
•
1.71M
•
35
daje/en_wiki
Viewer
•
Updated
•
5.09M
•
386
daje/ko_wiki
Viewer
•
Updated
•
311k
•
56
•
6