algorithm's picture

algorithm

algorithm

·

AI & ML interests

AI, Web Crawling

Recent Activity

liked a model 9 days ago

huihui-ai/DeepSeek-V3-abliterated

new activity 3 months ago

mistralai/Mistral-Nemo-Instruct-2407:why is the system prompt missing?

liked a model 3 months ago

quantflex/SmallThinker-3B-Preview-abliterated-GGUF

View all activity

Organizations

algorithm's activity

upvoted a collection 10 months ago

GLM-4

GLM-4 Open Models • 14 items • Updated 26 days ago • 117

upvoted 4 papers 10 months ago

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Paper • 2405.15319 • Published May 24, 2024 • 29

Aya 23: Open Weight Releases to Further Multilingual Progress

Paper • 2405.15032 • Published May 23, 2024 • 31

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27, 2024 • 53

Yuan 2.0-M32: Mixture of Experts with Attention Router

Paper • 2405.17976 • Published May 28, 2024 • 20

upvoted a paper about 1 year ago

LLM Augmented LLMs: Expanding Capabilities through Composition

Paper • 2401.02412 • Published Jan 4, 2024 • 38

upvoted 3 papers over 1 year ago

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 35

FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation

Paper • 2310.03214 • Published Oct 5, 2023 • 19

Meta-Transformer: A Unified Framework for Multimodal Learning

Paper • 2307.10802 • Published Jul 20, 2023 • 44