AI Native Daily Paper Digest – 20250402
1. Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation π Keywords: Any2Caption, Video Generation, Multimodal Large Language Models, Any2CapIns…
AI Native Daily Paper Digest – 20250401
1. TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes π Keywords: Complex Visual Text Generation, TextCrafter, Multi-Visual Text Rendering,…
AI Native Daily Paper Digest – 20250331
1. AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation π Keywords: Large Language Models, domain adaptation, vocabulary…
AI Native Daily Paper Digest – 20250328
1. Video-R1: Reinforcing Video Reasoning in MLLMs π Keywords: Video Reasoning, T-GRPO Algorithm, Multi-Modal Large Language Models, Temporal Modeling, Video-R1…
AI Native Daily Paper Digest – 20250327
1. Qwen2.5-Omni Technical Report π Keywords: Multimodal model, End-to-end, Thinker-Talker architecture, TMRoPE, Streaming π‘ Category: Multi-Modal Learning π Research Objective:…
AI Native Daily Paper Digest – 20250324
1. When Less is Enough: Adaptive Token Reduction for Efficient Image Representation π Keywords: Collection, Knowledge Representation, AI Systems π‘…
AI Native Daily Paper Digest – 20250320
1. Ο-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation π Keywords: Collection π‘ Category: Knowledge Representation and Reasoning…
AI Native Daily Paper Digest – 20250319
1. RWKV-7 “Goose” with Expressive Dynamic State Evolution π Keywords: Collection π‘ Category: Knowledge Representation and Reasoning π Research Objective:…
AI Native Daily Paper Digest – 20250317
1. ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Collection 1. Multi-Modal Learning 2. Generative Models 3. Reinforcement Learning 4.…