67 10 5

Yang

Kaichengalex

https://kaicheng-yang0828.github.io/

Kaicheng-Yang0828

AI & ML interests

None yet

Recent Activity

liked a dataset 19 days ago

embedding-data/QQP_triplets

liked a dataset 19 days ago

TripletCLIP/TripletCLIP-High-Quality

upvoted a paper 24 days ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

View all activity

Organizations

Kaichengalex's activity

liked 2 datasets 19 days ago

embedding-data/QQP_triplets

Viewer • Updated Aug 2, 2022 • 102k • 338 • 8

TripletCLIP/TripletCLIP-High-Quality

Viewer • Updated Oct 27 • 1.4M • 146 • 4

upvoted a paper 24 days ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published 26 days ago • 30

upvoted a paper about 1 month ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21 • 41

authored a paper about 1 month ago

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation

Paper • 2411.13025 • Published Nov 20 • 2

upvoted a paper about 1 month ago

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation

Paper • 2411.13025 • Published Nov 20 • 2

commented a paper about 1 month ago

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation

Paper • 2411.13025 • Published Nov 20 • 2 •

updated 4 models about 1 month ago

upvoted 7 papers about 1 month ago

Unicom: Universal and Compact Representation Learning for Image Retrieval

Paper • 2304.05884 • Published Apr 12, 2023 • 2

RWKV-CLIP: A Robust Vision-Language Representation Learner

Paper • 2406.06973 • Published Jun 11 • 1

High-Fidelity Facial Albedo Estimation via Texture Quantization

Paper • 2406.13149 • Published Jun 19 • 2

Multi-label Cluster Discrimination for Visual Representation Learning

Paper • 2407.17331 • Published Jul 24 • 2

Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension

Paper • 2410.14332 • Published Oct 18 • 1

CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination

Paper • 2408.09441 • Published Aug 18 • 2

ALIP: Adaptive Language-Image Pre-training with Synthetic Caption

Paper • 2308.08428 • Published Aug 16, 2023 • 1

authored a paper about 2 months ago

Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension

Paper • 2410.14332 • Published Oct 18 • 1

liked a model about 2 months ago

royokong/e5-v

Image-Text-to-Text • Updated Oct 31 • 2.92k • 18