10086 14 206

Tien Dung

tiendung

tiendung

AI & ML interests

None yet

Recent Activity

liked a model 13 days ago

BAAI/bge-large-zh-v1.5

liked a Space about 2 months ago

Qwen/QVQ-72B-preview

updated a Space 3 months ago

Symato/tomtat

View all activity

Organizations

tiendung's activity

liked a model 13 days ago

BAAI/bge-large-zh-v1.5

Feature Extraction • Updated Apr 2, 2024 • 208k • • 484

liked a Space about 2 months ago

555

QVQ 72B Preview

🌍

Upload images and ask questions to get answers

updated a Space 3 months ago

Tóm Tắt dot AI

📉

Tóm tắt và chat với các nội dung từ các links được cung cấp

liked a dataset 4 months ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1, 2024 • 1.05M • 9.14k • 431

updated a collection 4 months ago

RAG

Collection

liked 2 models 4 months ago

5CD-AI/ColVintern-1B-v1

Feature Extraction • Updated Nov 14, 2024 • 109 • 6

ltg/gpt-bert-babylm-base

Updated 2 days ago • 6.58k • 6

liked 3 datasets 4 months ago

upvoted a paper 4 months ago

Unifying Multimodal Retrieval via Document Screenshot Embedding

Paper • 2406.11251 • Published Jun 17, 2024 • 10

liked a model 4 months ago

MrLight/dse-qwen2-2b-mrl-v1

Visual Document Retrieval • Updated 15 days ago • 8.62k • 53

liked a Space 4 months ago

Tóm Tắt dot AI

📉

Tóm tắt và chat với các nội dung từ các links được cung cấp

updated a collection 4 months ago

RAG

Collection

liked a dataset 4 months ago

5CD-AI/Vietnamese-THUIR-T2Ranking-gg-translated

Viewer • Updated Jun 5, 2024 • 361M • 1.26k • 19

upvoted a collection 4 months ago

new architecture

Collection

20 items • Updated Dec 17, 2024 • 3

upvoted a paper 4 months ago

GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31, 2024 • 14

reacted to singhsidhukuldeep's post with 👀 4 months ago

Post

2116

Exciting Research Alert: Revolutionizing Dense Passage Retrieval with Entailment Tuning!

The good folks at HKUST have developed a novel approach that significantly improves information retrieval by leveraging natural language inference.

The entailment tuning approach consists of several key steps to enhance dense passage retrieval performance.

Data Preparation
- Convert questions into existence claims using rule-based transformations.
- Combine retrieval data with NLI data from SNLI and MNLI datasets.
- Unify the format of both data types using a consistent prompting framework.

Entailment Tuning Process
- Initialize the model using pre-trained language models like BERT or RoBERTa.
- Apply aggressive masking (β=0.8) specifically to the hypothesis components while preserving premise information.
- Train the model to predict the masked hypothesis tokens from the premise content.
- Run the training for 10 epochs using 8 GPUs, taking approximately 1.5-3.5 hours.

Training Arguments for Entailment Tuning (Yes! They Shared Them)
- Use a learning rate of 2e-5 with 100 warmup steps.
- Set batch size to 128.
- Apply weight decay of 0.01.
- Utilize the Adam optimizer with beta values (0.9, 0.999).
- Maintain maximum gradient norm at 1.0.

Deployment
- Index passages using FAISS for efficient retrieval.
- Shard vector store across multiple GPUs.
- Enable sub-millisecond retrieval of the top-100 passages per query.

Integration with Existing Systems
- Insert entailment tuning between pre-training and fine-tuning stages.
- Maintain compatibility with current dense retrieval methods.
- Preserve existing contrastive learning approaches during fine-tuning.

Simple, intuitive, and effective!

This advancement significantly improves the quality of retrieved passages for question-answering systems and retrieval-augmented generation tasks.

updated a collection 4 months ago

Vietnamese LLMs

Collection

The good ones • 12 items • Updated Nov 2, 2024