Datasets
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
LLM To Create Training Data For Domain Generalization
LLM To Create Training Data For Domain Generalization
Dataset Synthesis With LLM
Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses
Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses
Large Language Models
SportQA, A New Dataset That Measures The Comprehension Of Sports In Large Language Models
SportQA, A New Dataset That Measures The Comprehension Of Sports In Large Language Models
Large Language Models
Proposal For A New Evaluation Method For AI Assistants Based On Human Preferences
Proposal For A New Evaluation Method For AI Assistants Based On Human Preferences
Large Language Models
New UrbanSARFloods Dataset Solves Flood Detection Challenges
New UrbanSARFloods Dataset Solves Flood Detection Challenges
Datasets
Persona Hub, A Large Dataset Built From 1 Billion Personas, Is Now Available!
Persona Hub, A Large Dataset Built From 1 Billion Personas, Is Now Available!
Persona-driven Data Synthesis
InfiMM-WebMath-40B] Improves The Mathematical Performance Of LLM With A Dataset Consisting Of 2.4 Billion Mathematical Documents!
InfiMM-WebMath-40B] Improves The Mathematical Performance Of LLM With A Dataset Consisting Of 2.4 Bi ...
Datasets
IndiBias, A New Dataset For Measuring India-specific Social Biases
IndiBias, A New Dataset For Measuring India-specific Social Biases
Large Language Models
[EDAT24] Event-based Dataset Specialized For Manufacturing Operation Classification
[EDAT24] Event-based Dataset Specialized For Manufacturing Operation Classification
Datasets
[JMMLU] Prompt Politeness Affects LLM Performance!
[JMMLU] Prompt Politeness Affects LLM Performance!
ChatGPT
Analog And Multimodal Manufacturing Data Sets Acquired On The Future Factory Platform
Analog And Multimodal Manufacturing Data Sets Acquired On The Future Factory Platform
Datasets
OpenToM, A Benchmark For Evaluating Whether An LLM Has A "theory Of Mind," Is Now Available!
OpenToM, A Benchmark For Evaluating Whether An LLM Has A "theory Of Mind," Is Now Available!
Datasets
BioPlanner" And "BIOPROT Dataset" Automate Experimental Protocols For Biological Research
BioPlanner" And "BIOPROT Dataset" Automate Experimental Protocols For Biological Research
Large Language Models
Investigation Of A Method To Continuously Authenticate Users With Mouse Movements
Investigation Of A Method To Continuously Authenticate Users With Mouse Movements
Machine Learning
Machine Learning System For Continuous Certification With New Datasets
Machine Learning System For Continuous Certification With New Datasets
Machine Learning