Generative Model
Forget-Me-Not: A Proposal For A Simple Prompting Technique To Prevent Forgetting Information In Long Prompts
Forget-Me-Not: A Proposal For A Simple Prompting Technique To Prevent Forgetting Information In Long ...
Potential Of The Conversation Optimization Tokenizer: A Method To Improve LLM Inference Efficiency By 10%
Potential Of The Conversation Optimization Tokenizer: A Method To Improve LLM Inference Efficiency B ...
RoboTwin 2.0: Scalable Synthetic Data Generation And Benchmark Design For Dual-Arm Manipulation Robots
RoboTwin 2.0: Scalable Synthetic Data Generation And Benchmark Design For Dual-Arm Manipulation Robo ...
Enhanced LLM Code Generation With Property-based Testing! New Framework PGS To Break Self-Deception
Enhanced LLM Code Generation With Property-based Testing! New Framework PGS To Break Self-Deception
Evolution Of Llama To Support Reinforcement Learning, OctoThinker Shows The Power Of Intermediate Learning
Evolution Of Llama To Support Reinforcement Learning, OctoThinker Shows The Power Of Intermediate Le ...
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
FedNano: Lightweight And Efficient Distributed Learning Of Large-scale Multimodal Models
FedNano: Lightweight And Efficient Distributed Learning Of Large-scale Multimodal Models
ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation
ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation
PictSure: A New Method To Challenge Few-Shot Classification With The Power Of Visual Embedding
PictSure: A New Method To Challenge Few-Shot Classification With The Power Of Visual Embedding
Semantics-Oriented Reward Design With "PrefBERT," A New Evaluation Method To Evolve Long Sentence Generation
Semantics-Oriented Reward Design With "PrefBERT," A New Evaluation Method To Evolve Long Sentence Ge ...
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation
Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation
Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses
Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses
Large Language Models
MaskDiT: Low Learning Cost Diffusion Model For Image Generation
MaskDiT: Low Learning Cost Diffusion Model For Image Generation
Image Generation
Limitations And Possibilities Of Large-Scale Language Models In Vietnamese High School Chemistry Exam Questions
Limitations And Possibilities Of Large-Scale Language Models In Vietnamese High School Chemistry Exa ...
Large Language Models
Giving LLMs A Whiteboard To Write Down Their Reasoning Process Greatly Improves Their Visual Reasoning Ability!
Giving LLMs A Whiteboard To Write Down Their Reasoning Process Greatly Improves Their Visual Reasoni ...
Prompting Method
MicroDiffusion: A Thousand-dollar Generative Image Quality Model That Outperforms Multi-million-dollar Models
MicroDiffusion: A Thousand-dollar Generative Image Quality Model That Outperforms Multi-million-doll ...
Image Generation