AgentBench, A Comprehensive Benchmark For Evaluating AI Agent Performance, Is Now Available! AgentBench, A Comprehensive Benchmark For Evaluating AI Agent Performance, Is Now Available! 21/09/2023 Agent Simulation
ChatEval, An Evaluation Framework That Allows AI Agents To Discuss With Each Other, Is Now Available! ChatEval, An Evaluation Framework That Allows AI Agents To Discuss With Each Other, Is Now Available ... 15/09/2023 Agent Simulation
MetaGPT, A Multi-agent Framework In Which AI Consistently Develops Systems, Is Now Available! MetaGPT, A Multi-agent Framework In Which AI Consistently Develops Systems, Is Now Available! 13/09/2023 Agent Simulation
CHATDEV, A Virtual Company Of AI Agents Developing Software! CHATDEV, A Virtual Company Of AI Agents Developing Software! 06/09/2023 Agent Simulation
Can Generative AI Be Applied To Research In Different Fields? Can Generative AI Be Applied To Research In Different Fields? 30/08/2023 ChatGPT
New "ToolQA" Dataset: Assesses The Ability Of Large Language Models To Solve Problems With External Tools New "ToolQA" Dataset: Assesses The Ability Of Large Language Models To Solve Problems With External ... 28/08/2023 Large Language Models
A Framework For Simulating Collaboration Between AI Agents And Others In A Virtual Environment Is Now Available! A Framework For Simulating Collaboration Between AI Agents And Others In A Virtual Environment Is No ... 25/08/2023 Agent Simulation
Is The Performance Of ChatGPT (GPT-3.5 And GPT-4) Changing? Stanford University And UC Berkeley Research Teams Investigate Is The Performance Of ChatGPT (GPT-3.5 And GPT-4) Changing? Stanford University And UC Berkeley Rese ... 23/08/2023 Large Language Models
What Part Of The Context Does The Large-scale Language Model Use? What Part Of The Context Does The Large-scale Language Model Use? 16/08/2023 Large Language Models
VLMaps To Improve Accuracy By Labeling Directly On 3D Maps VLMaps To Improve Accuracy By Labeling Directly On 3D Maps 09/08/2023 Robot
Can Large-scale Language Models Replace Humans In Text Evaluation Tasks? Can Large-scale Language Models Replace Humans In Text Evaluation Tasks? 02/08/2023 Large Language Models
What Impact Will ChatGPT Have On Our Lives, Business And Industry, And Education? What Impact Will ChatGPT Have On Our Lives, Business And Industry, And Education? 20/07/2023 ChatGPT
Large-scale Language Model Manipulates Android Applications! DroidBot-GPT, A New Tool To Automate Tasks Large-scale Language Model Manipulates Android Applications! DroidBot-GPT, A New Tool To Automate Ta ... 19/07/2023 Large Language Models
Are Humans Or Large-scale Language Models (ChatGPT, GPT-4) Better Instructors For Teaching Beginner Programming? Are Humans Or Large-scale Language Models (ChatGPT, GPT-4) Better Instructors For Teaching Beginner ... 14/07/2023 Large Language Models
A Framework Is Now Available That Allows Agents To Act Autonomously With Each Other Toward Task Completion! A Framework Is Now Available That Allows Agents To Act Autonomously With Each Other Toward Task Comp ... 03/07/2023 ChatGPT
Discover The ChatGPT's Mouth-watering Bias Depending On The Persona You Assign To It! Discover The ChatGPT's Mouth-watering Bias Depending On The Persona You Assign To It! 27/06/2023 ChatGPT