HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants Paper • 2405.09186 • Published May 15 • 22
ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning Paper • 2406.19741 • Published Jun 28 • 59
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching Paper • 2210.12540 • Published Oct 22, 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling Paper • 2207.11280 • Published Jul 22, 2022