12 66 87

Sylvestre Bcht

Sylvestre

Kakulukian

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

zed-industries/zeta

reacted to vikhyatk's post with 🔥 2 days ago

🚨 New VQA + captioning dataset! https://huggingface.co/datasets/moondream/megalith-mdqa Images from Megalith, captioned using Moondream, then transformed to short-form QA. 9M+ images, 6-10 QA pairs per image.

updated a collection 23 days ago

test

View all activity

Organizations

Sylvestre's activity

upvoted 2 papers about 1 month ago

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

Paper • 2501.04689 • Published Jan 8 • 17

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7 • 42

upvoted a paper about 2 months ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 60

upvoted a paper 4 months ago

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Paper • 2410.12628 • Published Oct 16, 2024 • 34

upvoted 2 collections 4 months ago

DocLayout-YOLO

Collection

Dataset and model for DocLayout-YOLO • 10 items • Updated Jan 14 • 13

Granite 3.0 Language Models

Collection

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated Dec 18, 2024 • 96

upvoted an article 6 months ago

Article

Deprecation of Git Authentication using password

Aug 25, 2023

• 25

upvoted a paper 6 months ago

An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion

Paper • 2408.03178 • Published Aug 6, 2024 • 39

upvoted a paper 7 months ago

SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain

Paper • 2407.19584 • Published Jul 28, 2024 • 63

upvoted a collection 7 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 648

upvoted 10 papers 7 months ago

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Paper • 2407.15642 • Published Jul 22, 2024 • 11

MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation

Paper • 2407.15060 • Published Jul 21, 2024 • 9

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

Artist: Aesthetically Controllable Text-Driven Stylization without Training

Paper • 2407.15842 • Published Jul 22, 2024 • 14

HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions

Paper • 2407.15187 • Published Jul 21, 2024 • 12