Bo Wang's picture

Bo Wang

bwang0911

·

AI & ML interests

information retrieval, representation learning

Recent Activity

updated a dataset 24 days ago

jinaai/mtvqa

published a dataset 24 days ago

jinaai/mtvqa

new activity 27 days ago

jinaai/jina-clip-v2:compare with BLIP2

View all activity

Organizations

bwang0911's activity

upvoted a paper 2 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106

upvoted a collection 3 months ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 6 days ago • 59

upvoted a paper 3 months ago

A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 21

upvoted an article 5 months ago

Article

A Short Summary of Chinese AI Global Expansion

By

•

Oct 1, 2024

• 15

upvoted a collection 5 months ago

jina-embeddings-v3

Multilingual multi-task general text embedding model • 6 items • Updated Sep 19, 2024 • 20

upvoted a paper 5 months ago

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

Paper • 2409.10173 • Published Sep 16, 2024 • 30

upvoted a collection 8 months ago

jina-clip

Multimodal text-image embeddings • 4 items • Updated Dec 14, 2024 • 10

upvoted a paper 8 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 91

upvoted a paper 9 months ago

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Paper • 2405.20204 • Published May 30, 2024 • 35

upvoted a paper 12 months ago

Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings

Paper • 2402.17016 • Published Feb 26, 2024 • 5

upvoted 2 papers about 1 year ago

Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

Paper • 2310.19923 • Published Oct 30, 2023 • 14

Generalist embedding models are better at short-context clinical semantic search than specialized embedding models

Paper • 2401.01943 • Published Jan 3, 2024 • 6

upvoted a collection about 1 year ago

jina-embeddings-v2

The V2 family of Jina Embeddings supports encoding large documents with 8k sequence length. • 8 items • Updated Sep 17, 2024 • 15

upvoted a paper over 1 year ago

Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

Paper • 2307.11224 • Published Jul 20, 2023 • 6