Giving LLMs A Whiteboard To Write Down Their Reasoning Process Greatly Improves Their Visual Reasoning Ability! Giving LLMs A Whiteboard To Write Down Their Reasoning Process Greatly Improves Their Visual Reasoni ... 26/12/2024 Prompting Method
Persona Hub, A Large Dataset Built From 1 Billion Personas, Is Now Available! Persona Hub, A Large Dataset Built From 1 Billion Personas, Is Now Available! 19/12/2024 Persona-driven Data Synthesis
An Evaluation Index To Quantify Social Bias In LLM Is Now Available! An Evaluation Index To Quantify Social Bias In LLM Is Now Available! 11/12/2024 Social Bias
It Is Clear That Human Memory Characteristics Are Present In LLM! It Is Clear That Human Memory Characteristics Are Present In LLM! 27/09/2024 Large Language Models
A Paper Examining Whether LLMs Understand Cultural Common Sense Is Now Available! A Paper Examining Whether LLMs Understand Cultural Common Sense Is Now Available! 27/09/2024 Cultural Commonsense
FABLES, A Dataset For Book Summarization Consisting Only Of Long Sentences Of 100k Tokens Or More, Is Now Available! FABLES, A Dataset For Book Summarization Consisting Only Of Long Sentences Of 100k Tokens Or More, I ... 23/08/2024 Large Language Models
A Platform For Assessing LLMs' Collaborative Behavior And Ability To Manage Shared Resources Is Now Available! A Platform For Assessing LLMs' Collaborative Behavior And Ability To Manage Shared Resources Is Now ... 22/08/2024 Simulation Platform
Models Reward Themselves And Train Themselves! Models Reward Themselves And Train Themselves! 28/07/2024 Self Rewarding
[JMMLU] Prompt Politeness Affects LLM Performance! [JMMLU] Prompt Politeness Affects LLM Performance! 26/07/2024 ChatGPT
SheetAgent, An LLM Agent That Automatically Performs Spreadsheet-based Tasks, Is Now Available! SheetAgent, An LLM Agent That Automatically Performs Spreadsheet-based Tasks, Is Now Available! 28/05/2024 ChatGPT
A Framework For Simulating Cultural Evolution In Groups With LLM Is Now Available! A Framework For Simulating Cultural Evolution In Groups With LLM Is Now Available! 27/05/2024 Cultural Evolution
Benchmarks Are Now Available To Evaluate How Well AI Agents Are Able To Capture The Implicit Intentions Of Users! Benchmarks Are Now Available To Evaluate How Well AI Agents Are Able To Capture The Implicit Intenti ... 27/05/2024 ChatGPT
The First Framework To Utilize LLM To Detect Fake News Is Now Available! The First Framework To Utilize LLM To Detect Fake News Is Now Available! 26/05/2024 Fakenews
OpenToM, A Benchmark For Evaluating Whether An LLM Has A "theory Of Mind," Is Now Available! OpenToM, A Benchmark For Evaluating Whether An LLM Has A "theory Of Mind," Is Now Available! 24/05/2024 Datasets
Can LLM Recreate A Persona Based On The Big Five! Can LLM Recreate A Persona Based On The Big Five! 23/05/2024 ChatGPT
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! 22/04/2024 ChatGPT
EmotionBench, A Framework For Quantifying LLM Emotions, Is Now Available! EmotionBench, A Framework For Quantifying LLM Emotions, Is Now Available! 19/04/2024 ChatGPT
A Framework Is Now Available That Brings Out Performance Beyond That Of GPT-4 By Allowing Diverse Agents To Debate Each Other! A Framework Is Now Available That Brings Out Performance Beyond That Of GPT-4 By Allowing Diverse Ag ... 12/10/2023 Agent Simulation