Kolmogorov-Arnold Network (KAN) Instead Of MLP To Improve Model Expressiveness And Performance Kolmogorov-Arnold Network (KAN) Instead Of MLP To Improve Model Expressiveness And Performance 24/09/2024 Large Language Models
[RetrievalAttention] Improved Efficiency Of LLM For Processing Long Contexts! [RetrievalAttention] Improved Efficiency Of LLM For Processing Long Contexts! 19/09/2024 Large Language Models
Development Of LLM Chatbot Specialized For Multiple Choice Questions In Physics At Indian High School Level Development Of LLM Chatbot Specialized For Multiple Choice Questions In Physics At Indian High Schoo ... 09/09/2024 Large Language Models
Integration Of Large-scale Language Models In HCI Research And Ethical Issues Integration Of Large-scale Language Models In HCI Research And Ethical Issues 06/09/2024 Large Language Models
LLM Learning From Failures, Proposing A New Benchmark "COTERRORSET LLM Learning From Failures, Proposing A New Benchmark "COTERRORSET 05/09/2024 Large Language Models
Visualizing The "inside Of The Head" Of A Language Model - The Internal Mechanism Of LLMs Revealed By The Knowledge Graph Visualizing The "inside Of The Head" Of A Language Model - The Internal Mechanism Of LLMs Revealed B ... 03/09/2024 Computation And Language
Ferret-UI, A Multimodal Large-scale Language Model For Mobile UI Ferret-UI, A Multimodal Large-scale Language Model For Mobile UI 02/09/2024 Large Language Models
GMS: Revolutionizing Manufacturing With ChatGPT And Diffusion Models GMS: Revolutionizing Manufacturing With ChatGPT And Diffusion Models 28/08/2024 Manufacturing
[BitNet B1.58] Achieved Accuracy Better Than Llama By Expressing Model Parameters In Three Values! [BitNet B1.58] Achieved Accuracy Better Than Llama By Expressing Model Parameters In Three Values! 27/08/2024 Large Language Models
Google's High-performance LLM That Compresses Very Long Prompt Sentences To Save Memory Google's High-performance LLM That Compresses Very Long Prompt Sentences To Save Memory 27/08/2024 Large Language Models
FABLES, A Dataset For Book Summarization Consisting Only Of Long Sentences Of 100k Tokens Or More, Is Now Available! FABLES, A Dataset For Book Summarization Consisting Only Of Long Sentences Of 100k Tokens Or More, I ... 23/08/2024 Large Language Models
A Platform For Assessing LLMs' Collaborative Behavior And Ability To Manage Shared Resources Is Now Available! A Platform For Assessing LLMs' Collaborative Behavior And Ability To Manage Shared Resources Is Now ... 22/08/2024 Simulation Platform
From DNA Analysis To Gene Expression Prediction And Large-scale Language Modeling For Bioinformatics From DNA Analysis To Gene Expression Prediction And Large-scale Language Modeling For Bioinformatics 21/08/2024 Large Language Models
AVI-Talking" Generates Natural 3D Talking Faces From Audio AVI-Talking" Generates Natural 3D Talking Faces From Audio 17/08/2024 Face Recognition
IndiBias, A New Dataset For Measuring India-specific Social Biases IndiBias, A New Dataset For Measuring India-specific Social Biases 16/08/2024 Large Language Models
[BitNet] Large-scale Language Model With 1-bit Inference [BitNet] Large-scale Language Model With 1-bit Inference 06/08/2024 BitNet
Potential And Limitations Of Media Bias Detection Using ChatGPT Potential And Limitations Of Media Bias Detection Using ChatGPT 31/07/2024 Large Language Models
Improved Diagnostic Accuracy, New Diagnostic Support Through Medically Specialized LLM Improved Diagnostic Accuracy, New Diagnostic Support Through Medically Specialized LLM 31/07/2024 Large Language Models