arxiv:2402.13991
Szymon Tworkowski
syzymon
AI & ML interests
Language models, theorem proving and much more!
Recent Activity
authored
a paper
about 1 month ago
Magnushammer: A Transformer-based Approach to Premise Selection
authored
a paper
about 1 month ago
Structured Packing in LLM Training Improves Long Context Utilization
authored
a paper
about 1 month ago
Hierarchical Transformers Are More Efficient Language Models
Organizations
None yet
models
5
syzymon/long_llama_code_7b_instruct
Text Generation
•
Updated
•
46
•
11
syzymon/long_llama_code_7b
Text Generation
•
Updated
•
36
•
31
syzymon/long_llama_3b
Text Generation
•
Updated
•
127
•
120
syzymon/long_llama_3b_instruct
Text Generation
•
Updated
•
606
•
25
syzymon/long_llama_3b_v1_1
Text Generation
•
Updated
•
24
•
10
datasets
None public yet