Ke

Bingxin

AI & ML interests

Computer Vision; 3D Vision;

Recent Activity

upvoted a paper about 1 month ago
Video Depth without Video Models
authored a paper about 1 month ago
Video Depth without Video Models
updated a model about 1 month ago
prs-eth/rollingdepth-v1-0
View all activity

Organizations

Photogrammetry and Remote Sensing Lab of ETH Zurich's profile picture

Bingxin's activity

New activity in prs-eth/marigold-depth-v1-0 2 months ago

Update README.md

#11 opened 2 months ago by
wcs990791
New activity in prs-eth/marigold-depth-lcm-v1-0 2 months ago

Update README.md

#4 opened 2 months ago by
wcs990791
New activity in prs-eth/marigold-depth-lcm-v1-0 9 months ago

add fp16 checkpoints

#2 opened 9 months ago by
Bingxin
New activity in prs-eth/marigold-depth-v1-0 9 months ago

add fp16 checkpoints (new)

#5 opened 9 months ago by
Bingxin

add fp16 checkpoints

#4 opened 9 months ago by
Bingxin
reacted to osanseviero's post with 🤗 9 months ago
view post
Post
1618
Diaries of Open Source. Part 10 🚀

🌼Marigold-LCM: A super fast SOTA Depth Estimator
Demo: prs-eth/marigold-lcm
Original paper: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation (2312.02145)
Model: https://hf.co/prs-eth/marigold-lcm-v1-0

🌟Quiet-STaR: A self-teaching technique via internal monologue
Paper: Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking (2403.09629)
GitHub: https://github.com/ezelikman/quiet-star
Tweetutorial: https://twitter.com/ericzelikman/status/1768663835106513041

🖼️ WebSight v0.2: A image-to-code dataset containing tailwind CSS, images in screenshots, and more!
Dataset: HuggingFaceM4/WebSight
Paper: Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset (2403.09029)
Blog: https://hf.co/blog/websight

🕵️Agent-FLAN - effective agent tuning for LLMs
Paper: Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models (2403.12881)
Model: internlm/Agent-FLAN-7b
Dataset: internlm/Agent-FLAN
Website: https://internlm.github.io/Agent-FLAN/

🔥HPT, a family of multimodal LLMs from HyperGAI
Blog post: https://hypergai.com/blog/introducing-hpt-a-family-of-leading-multimodal-llms
Model: HyperGAI/HPT
GitHub: https://github.com/hyperGAI/HPT

🌏Models and datasets around the world
- Tess-70B, a MiQu-70B fine-tune with high-quality data migtissera/Tess-70B-v1.6
- UNI, a model trained on 100 million pathology images from 100k+ slides MahmoodLab/UNI
- CONCH, a VLM trained on 1.17 million pathology image-text pairs MahmoodLab/CONCH
·
New activity in prs-eth/marigold-depth-v1-0 about 1 year ago

safetensors

2
#1 opened about 1 year ago by
graphnull
liked a Space about 1 year ago