UCL DARK

university

https://dark.cs.ucl.ac.uk/

UCL_DARK

ucl-dark

Activity Feed Request to join this org

AI & ML interests

The UCL Deciding, Acting, and Reasoning with Knowledge (DARK) Lab is a Reinforcement Learning research group at the UCL Centre for Artificial Intelligence. We focus on research in complex open-ended environments that provide a constant stream of novel observations without reliable reward functions, often requiring agents to create their own curricula and to deal with external knowledge, natural language, and hard exploration problems.

Recent Activity

aengusl authored a paper 3 months ago

Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs

rraileanu authored a paper 3 months ago

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

rraileanu authored a paper 10 months ago

Teaching Large Language Models to Reason with Reinforcement Learning

View all activity

UCL-DARK's activity

aengusl

authored a paper 3 months ago

Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs

Paper • 2407.15549 • Published Jul 22

rraileanu

authored a paper 3 months ago

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

Paper • 2409.08239 • Published Sep 12 • 16

rraileanu

authored a paper 10 months ago

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7 • 46

christofernal

authored a paper 10 months ago

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7 • 46

rraileanu

authored a paper 10 months ago

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Paper • 2402.16822 • Published Feb 26 • 15

robkirk

updated a Space 11 months ago

README

DanielCHTan97

updated a dataset 11 months ago

UCL-DARK/tqa_translate

Updated Jan 30 • 6

robkirk

authored a paper 11 months ago

Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks

Paper • 2311.12786 • Published Nov 21, 2023 • 2

robkirk

updated 5 datasets about 1 year ago

UCL-DARK/sequential-instructions

Viewer • Updated Oct 26, 2023 • 533 • 69 • 3

UCL-DARK/alpaca-farm-id-test

Viewer • Updated Oct 26, 2023 • 1.03k • 44

UCL-DARK/openai-tldr-filtered-queries

Viewer • Updated Oct 26, 2023 • 303k • 44

UCL-DARK/openai-tldr-summarisation-preferences

Viewer • Updated Oct 26, 2023 • 177k • 263 • 1

UCL-DARK/openai-tldr-filtered

Viewer • Updated Oct 26, 2023 • 130k • 88 • 1

robkirk

authored 2 papers about 1 year ago

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Paper • 2310.06452 • Published Oct 10, 2023 • 2

Reward Model Ensembles Help Mitigate Overoptimization

Paper • 2310.02743 • Published Oct 4, 2023 • 1

rraileanu

authored a paper over 1 year ago

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 38

christofernal

authored 3 papers over 1 year ago

PEER: A Collaborative Language Model

Paper • 2208.11663 • Published Aug 24, 2022 • 2

Augmented Language Models: a Survey

Paper • 2302.07842 • Published Feb 15, 2023 • 3

Neurons in Large Language Models: Dead, N-gram, Positional

Paper • 2309.04827 • Published Sep 9, 2023 • 16

rraileanu

authored a paper over 1 year ago

Challenges and Applications of Large Language Models

Paper • 2307.10169 • Published Jul 19, 2023 • 47