Citaman (Anthonny OLIME)

upvoted an article 3 months ago

Article

Token Merging for fast LLM inference : Background and first trials with Mistral

By

•

Apr 30, 2024

• 3

upvoted a paper 5 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 65

upvoted an article 6 months ago

Article

How I train a LoRA: m3lt style training overview

By

•

Jul 1, 2024

• 48

upvoted a paper 6 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 96

upvoted 2 papers 7 months ago

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 86

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 53

upvoted a collection 7 months ago

Universal token classification

Collection

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 11 items • Updated Sep 10, 2024 • 12

upvoted 3 papers 7 months ago

upvoted an article 7 months ago

Article

GPU Poor Savior: Revolutionizing Low-Bit Open Source LLMs and Cost-Effective Edge Computing

By

•

May 25, 2024

• 10

upvoted 2 articles 9 months ago

Article

Transformers

By

•

Jul 2, 2024

• 6

Article

Diffusion Models

By

•

May 19, 2024

• 14

upvoted 6 papers 9 months ago

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 78

Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

Paper • 2403.18795 • Published Mar 27, 2024 • 18

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27, 2024 • 24

ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion

Paper • 2403.18818 • Published Mar 27, 2024 • 24

ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27, 2024 • 52

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 45

upvoted a collection 9 months ago

MGM

Collection

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47

Anthonny OLIME

AI & ML interests

Recent Activity

Organizations

Citaman's activity

Token Merging for fast LLM inference : Background and first trials with Mistral

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

How I train a LoRA: m3lt style training overview

Scaling Synthetic Data Creation with 1,000,000,000 Personas

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Needle In A Multimodal Haystack

Universal token classification

Yuan 2.0-M32: Mixture of Experts with Attention Router

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

GPU Poor Savior: Revolutionizing Low-Bit Open Source LLMs and Cost-Effective Edge Computing

Transformers

Diffusion Models

The Unreasonable Ineffectiveness of the Deeper Layers

Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

Long-form factuality in large language models

ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion

ViTAR: Vision Transformer with Any Resolution

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

MGM