5 7 18

Xiao Liu

ShawLiu

https://github.com/xiao9905

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

updated a model 27 days ago

THUDM/webrl-orm-llama-3.1-8b

updated a model about 2 months ago

THUDM/webrl-llama-3.1-70b

View all activity

Organizations

ShawLiu's activity

upvoted a paper 16 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 20 days ago • 16

updated a model 27 days ago

THUDM/webrl-orm-llama-3.1-8b

Updated 27 days ago • 42 • 1

updated a model about 2 months ago

THUDM/webrl-llama-3.1-70b

Updated Nov 12, 2024 • 17 • 4

liked a model 2 months ago

THUDM/webrl-llama-3.1-8b

Updated Nov 6, 2024 • 53 • 3

updated a model 2 months ago

THUDM/webrl-llama-3.1-8b

Updated Nov 6, 2024 • 53 • 3

liked a model 2 months ago

THUDM/webrl-glm-4-9b

Updated Nov 5, 2024 • 35 • 8

authored 14 papers 2 months ago

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation

Paper • 2304.05977 • Published Apr 12, 2023 • 1

Self-supervised Learning: Generative or Contrastive

Paper • 2006.08218 • Published Jun 15, 2020

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

Paper • 2311.04155 • Published Nov 7, 2023 • 1

Language Models are Open Knowledge Graphs

Paper • 2010.11967 • Published Oct 22, 2020

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

Paper • 2311.18702 • Published Nov 30, 2023

P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks

Paper • 2110.07602 • Published Oct 14, 2021

SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions

Paper • 2309.07045 • Published Sep 13, 2023

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Paper • 2308.14508 • Published Aug 28, 2023 • 2

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

Paper • 2402.14672 • Published Feb 22, 2024

GraphMAE: Self-Supervised Masked Graph Autoencoders

Paper • 2205.10803 • Published May 22, 2022