5 39 103

Anthony W Figueroa

THEFIG

AI & ML interests

None yet

Recent Activity

liked a Space 12 days ago

Qwen/Qwen2.5-Coder-Artifacts

upvoted a collection 20 days ago

SmolLM2

liked a model about 1 month ago

facebook/sam2.1-hiera-large

View all activity

Organizations

None yet

THEFIG's activity

upvoted a collection 20 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated 3 days ago • 177

upvoted an article 2 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 46

upvoted 2 collections 5 months ago

Models Used in HackerNoon Publishing System

Collection

HackerNoon.com’s content management system empowers a small team to manage tens of thousands of writers, advertisers, & millions of readers 🙏 🤖 🙏🤖 • 14 items • Updated Sep 23 • 21

OpenCodeInterpreter

Collection

18 items • Updated Mar 3 • 82

upvoted an article 5 months ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

•

Jun 29

• 33

upvoted a collection 5 months ago

Gemma 2 Release

Collection

15 items • Updated Sep 9 • 197

upvoted a paper 6 months ago

Imp: Highly Capable Large Multimodal Models for Mobile Devices

Paper • 2405.12107 • Published May 20 • 25

upvoted a collection 8 months ago

OpenCulture

Collection

A multilingual dataset of public domain books and newspapers. • 27 items • Updated 18 days ago • 117

upvoted a paper 8 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

upvoted 3 papers 9 months ago

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 93

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Paper • 2403.02677 • Published Mar 5 • 16

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19 • 54

upvoted 2 papers 10 months ago

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

Paper • 2401.17053 • Published Jan 30 • 31

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25 • 47

upvoted a paper 11 months ago

MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices

Paper • 2312.16886 • Published Dec 28, 2023 • 19

upvoted a paper 12 months ago

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 37

upvoted 4 papers about 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 18

4K4D: Real-Time 4D View Synthesis at 4K Resolution

Paper • 2310.11448 • Published Oct 17, 2023 • 36

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 26