arxiv:2406.08414
Alex J. Chan
XanderJC
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking
Unrelated Questions
authored
a paper
3 days ago
Dense Reward for Free in Reinforcement Learning from Human Feedback
updated
a model
4 months ago
XanderJC/sft-llava-1.5-7b-hf
Organizations
Papers
3
models
12
XanderJC/sft-llava-1.5-7b-hf
Image-Text-to-Text
•
Updated
•
7
XanderJC/sft_openassistant-guanaco
Text Generation
•
Updated
•
12
XanderJC/llama-3-8b-orca-abc
Reinforcement Learning
•
Updated
•
2
XanderJC/llama-3-8b-orca-rlhf
Reinforcement Learning
•
Updated
•
2
XanderJC/llama-3-8b-orca-rm
Updated
•
1
XanderJC/phi2-sft-tldr-merged
Text Generation
•
Updated
•
10
XanderJC/phi2-sft-tldr
Updated
•
8
XanderJC/gptj-rm-tldr-merged
Text Classification
•
Updated
•
7
XanderJC/gptj-sft-tldr-merged
Text Generation
•
Updated
•
13
XanderJC/gptj-sft-tldr
Updated
•
7
datasets
None public yet