Index of /public/video/video/

directories: 0, files: 735


NameLast modifiedSize
../
Groma: Revolutionizing Multimodal LLMs with Localized Visual Tokenization.mp42024-06-08 15:55:5444.72 MiB
InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds.mp42024-06-08 15:31:3434.81 MiB
VQ-Diffusion: Next-Gen Text-to-Image Synthesis.mp42024-06-08 15:28:0733.04 MiB
YOLO-World: Breakthrough in Real-Time Object Detection Across Any Vocabulary!.mp42024-06-08 15:44:3536.58 MiB
Breaking Boundaries: Advanced Language Modeling with In-Context Pretraining.mp42024-06-08 15:48:4139.07 MiB
GPT-4V(ision) Unveiled: Capabilities and Limitations Explored!.mp42024-06-08 16:01:0952.12 MiB
Gaussian Surfels: A 3D Modeling Revolution.mp42024-06-08 15:27:0135.97 MiB
CM3Leon: The Multi-Modal Marvel in AI Generation!.mp42024-06-08 15:55:5737.93 MiB
Reinforced Self-Training: A Game Changer for Language Models.mp42024-06-08 15:31:2338.87 MiB
Recurrent Memory Transformer: Revolutionizing Long-Term Dependencies.mp42024-06-08 15:49:1546.48 MiB
Unveiling Instruct-Imagen: Multi-modal Image Generation Revolution.mp42024-06-08 16:01:0446.51 MiB
Nemotron-4 15B: Exploring the Power of a Cutting-Edge Multilingual Model.mp42024-06-08 15:55:2438.93 MiB
Training Gopher: Insights from Scaling Language Models.mp42024-06-08 15:33:0035.32 MiB
Turbocharging Video Quality: T2V-Turbo Explained.mp42024-06-08 15:44:4638.80 MiB
MoAI: Revolutionizing AI with a Fusion of Vision and Language Models.mp42024-06-08 15:31:3945.03 MiB
OmniFusion: Revolutionizing Multimodal AI with Text and Image Integration.mp42024-06-08 15:44:1746.63 MiB
Grandmaster Chess Moves: A Search-Free Transformation.mp42024-06-08 15:52:3939.54 MiB
OctoPack: Revolutionizing Code LLMs with Git Commit Instructions.mp42024-06-08 15:55:2732.44 MiB
Revolutionizing Audio: Unveiling HiFi-Codec's High-Fidelity Compression.mp42024-06-08 15:34:0440.31 MiB
Mastering Neural Network Compression: Pruning & Quantization Simplified!.mp42024-06-08 15:52:3646.07 MiB
Chronos: Mastering Time Series with Language Models.mp42024-06-08 15:37:2538.91 MiB
ETSformer: The Future of Time-series Forecasting!.mp42024-06-08 15:32:2145.04 MiB
Revolutionizing AI Costs: Beyond Chinchilla-Optimal Scaling for Language Models!.mp42024-06-08 15:30:2231.26 MiB
Efficient Transfer Learning Made Easy with Adapters: A Unified Library Explained.mp42024-06-08 15:46:5039.52 MiB
Unlocking Better Generalization: AdaBound & AMSBound Explained.mp42024-06-08 15:34:4734.31 MiB
BitNet: Energy-Efficient 1-bit Transformers for Large Language Models!.mp42024-06-08 16:05:1545.43 MiB
MiniGPT4-Video: Revolutionizing Video Understanding with Multimodal AI!.mp42024-06-08 15:37:5134.00 MiB
Revolutionizing Multimodal Models with Matryoshka Structure!.mp42024-06-08 15:53:3641.12 MiB
Simplifying 3D Vision: Discover the DUSt3R Breakthrough.mp42024-06-08 15:31:4236.79 MiB
Boosting Text-to-Image Models: Mastering Spatial Consistency!.mp42024-06-08 15:59:2240.62 MiB
FairFace: Unveiling the New Standard for Balanced Face Recognition.mp42024-06-08 16:04:0343.25 MiB
ConceptLab: Creating Imaginative Concepts Like Never Before!.mp42024-06-08 16:02:1738.21 MiB
Revolutionizing LLMs: How System 2 Attention Enhances Accuracy and Objectivity!.mp42024-06-08 15:59:4936.90 MiB
Revolutionizing Motor Control: GEM Toolbox for RL Agents.mp42024-06-08 15:49:4946.02 MiB
PokéLLMon: The AI That Battles Like a Pro Pokémon Trainer!.mp42024-06-08 15:46:1945.00 MiB
Kosmos-2: Bridging Text and Vision with Grounded AI.mp42024-06-08 15:41:5238.08 MiB
Revolutionizing Multimodal Models with CosMo: A Deep Dive!.mp42024-06-08 16:00:4348.34 MiB
Cracking the Code: Summarizing Books with AI & Human Feedback.mp42024-06-08 16:03:4539.45 MiB
Predicting the Future: Humanoid Locomotion with Next Token Models.mp42024-06-08 15:58:0639.08 MiB
Revolutionary Layer Pruning: Are Deeper Layers Overrated?.mp42024-06-08 15:42:2541.41 MiB
Simplifying Music Creation: Unveiling MusicGen from Facebook Research.mp42024-06-08 15:23:3540.57 MiB
Mastering Multi-Subject Text-to-Image Generation with Bounded Attention!.mp42024-06-08 15:36:2736.22 MiB
SnapKV: Transforming LLM Efficiency with Intelligent KV Cache Compression!.mp42024-06-08 16:07:0535.67 MiB
How Prometheus Rivals GPT-4 in Evaluating AI Responses!.mp42024-06-08 15:57:1943.57 MiB
How a Single Image Can Compromise AI Safety: The Power of Visual Adversarial Attacks.mp42024-06-08 15:24:3243.54 MiB
MetRag: Revolutionizing Retrieval-Augmented Generation with Multi-layered Thoughts.mp42024-06-08 15:45:0137.85 MiB
Mastering Image Recognition with Deep Residual Learning (ResNets).mp42024-06-08 15:43:2746.31 MiB
Compromising LLM-Integrated Apps with Indirect Prompt Injection: Explained!.mp42024-06-08 16:07:5347.04 MiB
Transformers Unmasked: Their Strengths and Limitations.mp42024-06-08 15:33:2837.58 MiB
How Learning from Mistakes Supercharges AI Reasoning.mp42024-06-08 15:28:2230.50 MiB
Vision and Text: The Future of Deep Learning Explained!.mp42024-06-08 15:26:2337.41 MiB
Making AI Models Mobile: Imp's Breakthrough!.mp42024-06-08 15:27:5535.70 MiB
Extend Large Language Models Without Fine-Tuning: Introducing SelfExtend!.mp42024-06-08 15:28:1946.51 MiB
SuperPoint: Game-Changing Self-Supervised Interest Point Detector.mp42024-06-08 15:27:0836.02 MiB
Bridging the Gap: Can Large Language Models Achieve Theory-of-Mind?.mp42024-06-08 16:06:5739.61 MiB
AutoCoder: Beating GPT-4 in Code Generation! .mp42024-06-08 15:29:5844.46 MiB
CoAtNet: The Ultimate Hybrid of Convolution and Attention in Neural Networks! .mp42024-06-08 15:59:3346.27 MiB
Boosting Vision Transformers: How Register Tokens Enhance Performance!.mp42024-06-08 15:33:4042.68 MiB
Speeding Up Audio Creation: Dive into MAGNeT's Non-Autoregressive Transformer!.mp42024-06-08 15:48:2036.39 MiB
Revolutionizing Image Reconstruction Using fMRI Data: The MindEye2 Approach.mp42024-06-08 15:47:4241.02 MiB
Goodbye Attention? Meet the LambdaNetworks Revolution!.mp42024-06-08 15:50:3837.01 MiB
AlignProp: Revolutionizing Text-to-Image Models with Reward Backpropagation .mp42024-06-08 15:30:3741.26 MiB
VMamba: Revolutionizing Visual Representation with Linear Complexity.mp42024-06-08 15:29:5443.27 MiB
Revolutionary Vision Model: MLP-Mixer Unveiled.mp42024-06-08 16:05:2640.76 MiB
Revolutionizing 3D: Single Image to High-Quality 3D Models with Compress3D!.mp42024-06-08 15:34:1241.04 MiB
MoleculeSTM: Bridging Chemical Structures and Text for Superior Drug Discovery .mp42024-06-08 15:38:3245.21 MiB
Your Transformer Might Be Linear! | Deep Dive.mp42024-06-08 15:40:3747.22 MiB
What Matters Most in Vision-Language Models?.mp42024-06-08 15:40:2644.16 MiB
New Method Beats GPT-4 in Machine Translation: Introducing Contrastive Preference Optimization.mp42024-06-08 15:47:2238.84 MiB
Enhancing Digital Agents with Autonomous Evaluation Techniques.mp42024-06-08 15:43:2343.28 MiB
Achieve Ultra-Realistic Image Restoration with SUPIR: The Future of Photo-Enhancement .mp42024-06-08 16:03:1153.13 MiB
Aya 23: The Next Leap in Multilingual Language Models.mp42024-06-08 15:43:1243.85 MiB
The Power of Scale: Efficient Prompt Tuning Explained!.mp42024-06-08 15:52:1441.35 MiB
Lumiere's Breakthrough: Space-Time Diffusion for Stunning Video Generation.mp42024-06-08 15:39:1134.77 MiB
Doubly Efficient RL with Dropout Q-Functions: Meet DroQ!.mp42024-06-08 15:52:0138.50 MiB
Revolutionizing Real-Time Chatbots with Reinforcement Learning!.mp42024-06-08 15:49:4434.69 MiB
Accelerating Diffusion Training: The Min-SNR Weighting Strategy.mp42024-06-08 15:46:2731.41 MiB
DocLLM: Revolutionizing Document Understanding with Layout-Aware AI.mp42024-06-08 15:37:4836.76 MiB
Boosting Efficiency: Cobra's Leap in Multi-Modal AI Inference.mp42024-06-08 15:25:5036.53 MiB
PDFTriage: Revolutionizing Question Answering in Long, Structured Documents.mp42024-06-08 15:23:4132.54 MiB
PIVOT: Game-Changing Visual Prompts for Zero-Shot Robotics!.mp42024-06-08 15:45:2246.95 MiB
Unveiling FILIP: A Leap in Fine-grained Vision-Language Pre-Training.mp42024-06-08 16:00:3534.08 MiB
Uncovering Hidden Depths: Can Large Language Models Perform Multi-Hop Reasoning?.mp42024-06-08 15:33:5240.64 MiB
Revolutionizing Image Generation with Denoising Diffusion Models!.mp42024-06-08 16:00:1433.77 MiB
MagicAnimate: Revolutionizing Human Image Animation with Superior Temporal Consistency.mp42024-06-08 16:06:2142.10 MiB
How AI Learns to Use Tools: A Deep Dive into Toolformer!.mp42024-06-08 16:05:4836.59 MiB
SLiMe: Revolutionary One-Shot Image Segmentation with Stable Diffusion.mp42024-06-08 15:44:4053.72 MiB
Fine-Tuning Language Models: A Breakthrough in Reducing AI Hallucinations.mp42024-06-08 15:55:3434.95 MiB
Revolutionizing AI: FlexGen's Single GPU Power.mp42024-06-08 15:45:5337.25 MiB
Achieving Zero Bubbles in Pipeline Parallelism: A Deep Dive into Revolutionary Scheduling.mp42024-06-08 16:05:1140.07 MiB
OpenFlamingo: Open-Source Breakthrough in Vision-Language Models Training!.mp42024-06-08 15:54:2147.60 MiB
Revolutionizing Multimodal Learning: Meet CogVLM!.mp42024-06-08 15:24:3933.68 MiB
CogAgent: Revolutionizing GUI Interaction with Visual Language Models!.mp42024-06-08 15:30:1137.11 MiB
Building Better Language Agents: Lumos and Open-Source LLMs.mp42024-06-08 15:34:3747.62 MiB
Jamba: Revolutionizing Language Models with a Hybrid Transformer Approach.mp42024-06-08 15:53:4642.01 MiB
Unified Vision: How GiT's Universal Language Interface Revolutionizes Visual Models.mp42024-06-08 16:01:5437.70 MiB
Controlled Image Generation Without Re-training! Discover MultiDiffusion.mp42024-06-08 15:52:5642.89 MiB
Exploring OpenMoE: Breakthroughs in Mixture-of-Experts Language Models.mp42024-06-08 15:28:0449.88 MiB
LIMA: How Less Data Creates More Powerful AI Alignment!.mp42024-06-08 15:27:1938.91 MiB
Transforming a Single Image into 3D with AGG: A Revolutionary Approach .mp42024-06-08 15:30:1542.16 MiB
BiLLM: Supercharge LLMs with 1-Bit Quantization! 🚀.mp42024-06-08 15:33:5640.47 MiB
Unlocking Protein Secrets: Iterative SE(3)-Transformers Explained.mp42024-06-08 15:57:4440.69 MiB
SpatialVLM: Transforming Vision-Language Models with Spatial Reasoning!.mp42024-06-08 15:28:4846.03 MiB
Overcoming Challenges in Reinforcement Learning from Human Feedback (RLHF).mp42024-06-08 15:56:2743.05 MiB
Revolutionizing AI with AIOS: The Future of LLM Agents.mp42024-06-08 16:02:3450.75 MiB
Aligning AI Art: Diffusion-DPO Explained!.mp42024-06-08 15:51:2050.35 MiB
ObjectDrop: Revolutionizing Photorealistic Object Editing with Counterfactual Supervision.mp42024-06-08 16:07:2341.63 MiB
How Tool Documentation Revolutionizes AI Learning: New Breakthrough!.mp42024-06-08 15:50:4935.18 MiB
LENS: The Future of Computer Vision with Language Models!.mp42024-06-08 15:57:3141.17 MiB
NovoGrad: The Next-Gen Optimizer for Deep Learning.mp42024-06-08 15:41:2634.46 MiB
Agents: Unlocking User-Friendly Autonomous Language Agents!.mp42024-06-08 15:49:5455.41 MiB
Revolutionizing Image Segmentation: Unveiling MobileSAMv2.mp42024-06-08 16:01:2342.37 MiB
FaceStudio: Blend Your Face with Art in Seconds!.mp42024-06-08 15:41:1556.84 MiB
Muse: A Quantum Leap in Text-to-Image Generation with Transformers.mp42024-06-08 15:54:0148.69 MiB
Dynamic 4D Content Creation with GaussianFlow!.mp42024-06-08 15:25:2134.84 MiB
Meteor: Efficient Insights with Mamba Architecture.mp42024-06-08 15:24:0645.80 MiB
Creating Lifelike Avatars: From Audio to Photorealistic Conversations.mp42024-06-08 16:05:4143.79 MiB
DeepSpeed-Chat: Revolutionizing AI Training for Everyone!.mp42024-06-08 16:04:0035.59 MiB
Unlocking Spatial Positional Encoding with Learnable Fourier Features.mp42024-06-08 16:00:3833.29 MiB
MemGPT: Transforming AI Memory with OS-inspired Techniques.mp42024-06-08 15:44:0944.70 MiB
Efficient 3D Models with CompGS: Reducing Storage with Vector Quantization.mp42024-06-08 15:46:2454.26 MiB
Real-time Human Motion Generation: Exploring MotionLCM!.mp42024-06-08 15:24:2846.49 MiB
Transforming AI: How LongNet Handles A Billion Tokens Effortlessly!.mp42024-06-08 15:25:3245.33 MiB
Revolutionizing Attention Mechanisms with Kronecker Operators!.mp42024-06-08 15:21:3836.82 MiB
Adaptive Robots Tackling Everyday Tasks: A Breakthrough.mp42024-06-08 15:54:1743.29 MiB
How Many-Shot Learning Transforms Multimodal Models.mp42024-06-08 15:32:1641.67 MiB
Boosting AI Accuracy: Unveiling Self-RAG for Reliable Responses.mp42024-06-08 15:29:0048.59 MiB
Turbocharged Vision: Meet CatLIP, the New Champion in Image-Text Pre-training!.mp42024-06-08 15:22:3539.30 MiB
CLIP: Revolutionizing Vision with Language-Based Learning!.mp42024-06-08 15:54:1351.48 MiB
Mastering Text-Rich Images: Discover mPLUG-DocOwl 1.5's OCR-Free Revolution!.mp42024-06-08 15:50:0541.59 MiB
Extend Your Context with LongLoRA: Next-Level Large Language Models.mp42024-06-08 15:56:5644.50 MiB
Bayesian Flow Networks: A New Era in Generative Modeling .mp42024-06-08 15:45:1537.39 MiB
Unlocking the Power of Neural Discrete Representation Learning (VQ-VAE).mp42024-06-08 15:59:0035.08 MiB
Revolutionizing AI: SeaLLMs Tailored for Southeast Asia.mp42024-06-08 16:02:5645.65 MiB
VFusion3D: Revolutionizing 3D Model Generation with Video Diffusion.mp42024-06-08 15:56:1244.78 MiB
Data-Free Model Compression: Cutting-Edge Techniques Unveiled.mp42024-06-08 15:26:1043.08 MiB
Revolutionizing Transfer Learning: Meet Conditional Adapters (CoDA).mp42024-06-08 16:01:3943.40 MiB
Meet Dobb·E: The Future of Household Robots.mp42024-06-08 15:40:4254.74 MiB
Octo: Revolutionizing Robotic Learning with Generalist Policies.mp42024-06-08 15:58:5743.57 MiB
Mastering AI Instructions: Early Stopping for Better Tuning .mp42024-06-08 16:01:3239.71 MiB
Revolutionizing 3D Rendering: Faster and Efficient View Synthesis with Point-Based Radiance Fields.mp42024-06-08 15:41:2046.68 MiB
The Innovation of 2-Stage Backpropagation: Faster DNN Training!.mp42024-06-08 15:42:3241.27 MiB
Elevate Your AI: Turning Weak Models into Winners with Self-Play Fine-Tuning!.mp42024-06-08 15:58:3839.79 MiB
MambaByte: Revolutionizing Token-Free Language Models.mp42024-06-08 15:26:0232.26 MiB
Unveiling RLHF in Large Language Models: The Power of PPO .mp42024-06-08 16:05:0832.70 MiB
Revolutionizing Fine-Tuning: Unveiling MoRA's High-Rank Updates.mp42024-06-08 15:23:5833.04 MiB
Teaching Robots Soccer: Vision-Based Deep Reinforcement Learning Secrets!.mp42024-06-08 15:39:3845.72 MiB
Revolutionizing Image Editing: Adding Objects by Removing Them First!.mp42024-06-08 15:23:0141.64 MiB
How LoRAShear Revolutionizes Large Language Models: Efficient Pruning & Knowledge Recovery.mp42024-06-08 15:59:2937.72 MiB
MegaByte: Revolutionizing Sequence Prediction with Multiscale Transformers.mp42024-06-08 15:52:3138.24 MiB
Efficient Vision-Language Instruction Tuning: The MMA Approach.mp42024-06-08 15:55:1639.66 MiB
How JetMoE-8B Became a LLM Powerhouse on a Budget!.mp42024-06-08 15:33:3240.40 MiB
Achieving High-Fidelity Image Generation with Minimal Labels.mp42024-06-08 16:04:4942.93 MiB
Revolutionizing Text-to-3D in Seconds: The Instant3D Breakthrough!.mp42024-06-08 15:32:0842.71 MiB
Revolutionary Multi-Task AI: The Interactive Agent Foundation Model Explained.mp42024-06-08 16:03:5638.61 MiB
How Smart Prompts Supercharge LLM Recommendations! 🔥📚.mp42024-06-08 15:41:4543.36 MiB
Revolutionizing Code: Large Language Models for Compiler Optimization.mp42024-06-08 15:41:3043.21 MiB
The Limits of Transformers: Why Simple Tasks Trip Them Up.mp42024-06-08 15:34:5535.93 MiB
Unifying Multimodal Inputs: The Breakthrough of AnyMAL Language Model!.mp42024-06-08 15:53:0945.32 MiB
Bridging The Sim2Real Gap: The Power of Natural Language.mp42024-06-08 15:38:3834.54 MiB
Boosting Vision AI: Tackling Noise in Vision Transformers.mp42024-06-08 16:07:1944.01 MiB
Are AI Brains Just Copycats? Examining Math Skills of Large Language Models .mp42024-06-08 15:53:2857.41 MiB
StableDrag: The New Standard in Point-based Image Editing.mp42024-06-08 15:39:5739.96 MiB
TrustLLM: Making AI Models Safer and More Reliable!.mp42024-06-08 15:28:1545.48 MiB
ZipLoRA: Mastering Style & Subject in Generative Models.mp42024-06-08 16:05:2934.91 MiB
FlexiDreamer: Transforming Single Images into 3D with Cutting-Edge FlexiCubes.mp42024-06-08 15:53:3241.90 MiB
MADLAD-400: Revolutionizing Multilingual NLP with a 419-Language Dataset .mp42024-06-08 15:50:5338.11 MiB
Eureka Paradigm: Human-Level Rewards for Advanced Robotics with LLMs.mp42024-06-08 15:29:2047.24 MiB
How the Transformer Model Revolutionized AI.mp42024-06-08 16:04:1942.22 MiB
DreamDiffusion: Visualizing Thoughts with EEG Signals! .mp42024-06-08 15:58:5346.46 MiB
Revolutionizing Text-to-Image: Discover SPIN-Diffusion's Self-Play Magic!.mp42024-06-08 16:07:0837.46 MiB
Revolutionizing Video Creation: 4D-Guided Generative Rendering.mp42024-06-08 15:58:1441.18 MiB
From Images to Videos: PLLaVA's Breakthrough in Video Dense Captioning .mp42024-06-08 15:37:5435.42 MiB
Accelerating Diffusion Models: New Pseudo Numerical Methods!.mp42024-06-08 15:23:4732.31 MiB
Breakthrough in Video Synthesis: MAGVIT Unveiled!.mp42024-06-08 15:48:4842.04 MiB
Revolutionizing Vision: RMT Meets Vision Transformers.mp42024-06-08 15:22:1045.04 MiB
Making CLIP Practical: Data, Architecture, and Training Strategies Explored!.mp42024-06-08 15:46:1239.20 MiB
Scaling GANs: Meet GigaGAN, the New Text-to-Image Contender.mp42024-06-08 15:59:1537.65 MiB
Three Key Insights to Optimize Vision Transformers!.mp42024-06-08 16:00:5026.82 MiB
Game-Changer for Large Language Models: SparQ Attention Explained.mp42024-06-08 15:38:1535.77 MiB
Revolutionizing Image Translation with Unified Latent Spaces!.mp42024-06-08 15:24:5142.49 MiB
Are Language Models More Than Linear? Exploring Multi-Dimensional Representations.mp42024-06-08 15:44:0053.67 MiB
MAP-Neo: Unveiling the Next Generation of Bilingual AI!.mp42024-06-08 15:38:1239.37 MiB
Vision Mamba: Transforming Visual Learning with Bidirectional State Space Models.mp42024-06-08 15:37:3238.70 MiB
Octopus: Revolutionizing Vision-Language Programming with Environmental Feedback.mp42024-06-08 15:49:0648.60 MiB
Pegasus-1: Revolutionizing Video Understanding with Multimodal Language Models .mp42024-06-08 15:38:5255.17 MiB
The Game-Changing Method in Machine Translation: BPE-Dropout Explained!.mp42024-06-08 15:51:4742.03 MiB
BERT: Transforming NLP with Deep Bidirectional Transformers .mp42024-06-08 15:31:5739.97 MiB
Rethinking Sharpness and Generalization in Neural Networks.mp42024-06-08 15:43:5541.98 MiB
Revolutionizing Protein Design: The Power of Textual Descriptions.mp42024-06-08 15:56:4842.58 MiB
TransNormerLLM: The Future of Faster & Smarter Large Language Models.mp42024-06-08 15:50:3146.49 MiB
Wanda: Revolutionizing Pruning for Large Language Models.mp42024-06-08 15:23:5134.47 MiB
GPT-3: The Giant Few-Shot Learner - Explained!.mp42024-06-08 15:59:4141.61 MiB
CroissantLLM: The Game-Changing Bilingual AI Model for English and French!.mp42024-06-08 15:27:3549.74 MiB
How Sparrow Improves Dialogue Agents with Human Feedback.mp42024-06-08 15:54:4756.18 MiB
Exploring Model-based Reinforcement Learning in 10 Minutes!.mp42024-06-08 16:04:5344.85 MiB
Is Reinforcement Learning the Future of NLP?.mp42024-06-08 16:05:4439.13 MiB
Genie: Creating Interactive Worlds from Unlabeled Videos.mp42024-06-08 15:53:1238.37 MiB
WebAgent's Leap in Web Automation: Planning, Context, and Code Mastery!.mp42024-06-08 15:46:1533.33 MiB
The Future of Audio Compression: Meet SoundStream.mp42024-06-08 15:58:4937.83 MiB
Lemur: Bridging Natural Language and Code for Advanced AI Agents!.mp42024-06-08 15:53:5634.09 MiB
MaxViT: Revolutionizing Vision Transformers with Multi-Axis Attention.mp42024-06-08 15:57:0431.91 MiB
QLoRA: Memory-Efficient Fine-tuning for Large Language Models.mp42024-06-08 15:53:2333.15 MiB
The Power of Invalid Action Masking in Policy Gradient Algorithms.mp42024-06-08 15:40:0943.53 MiB
Unlimiformer: Transforming Long-Range Input Handling in Transformers.mp42024-06-08 15:41:0741.97 MiB
Uniting Giants: How SAM-CLIP Redefines Vision Models!.mp42024-06-08 15:39:0740.51 MiB
Mastering Model Quantization: The Power of QuIP in 2 Bits!.mp42024-06-08 16:06:2833.09 MiB
Breaking New Ground: Spacetime Gaussian Feature Splatting for Dynamic Views.mp42024-06-08 15:39:3039.79 MiB
VisionLLaMA: Revolutionizing Vision Tasks with Language Models.mp42024-06-08 15:51:2448.39 MiB
Revolutionizing Humanoid Robots: The ExBody Approach.mp42024-06-08 15:29:2741.02 MiB
FusionFrames: Revolutionizing Text-to-Video with Efficient Pipelines.mp42024-06-08 15:50:1346.14 MiB
Unlocking Long-Context Superpowers in Language Models.mp42024-06-08 15:42:5338.79 MiB
Revolutionizing Diffusion Models with Classifier-Free Guidance.mp42024-06-08 15:38:4243.73 MiB
Unlocking Vision-Language Magic: The Secret of BLIP-2!.mp42024-06-08 15:40:5535.18 MiB
FrugalGPT: Cut Costs and Boost Performance with LLMs .mp42024-06-08 15:49:3750.30 MiB
Voicebox: Revolutionizing Multilingual Speech Generation at Scale.mp42024-06-08 15:40:0549.18 MiB
The Future of AI: Chameleon’s Breakthrough in Multimodal Models.mp42024-06-08 15:42:0635.16 MiB
Revolutionary 3D Object Synthesis with MVEdit!.mp42024-06-08 15:27:5944.59 MiB
Meet OS-Copilot: The Future of Generalist AI Agents.mp42024-06-08 15:53:2046.77 MiB
LaVie: Revolutionizing Video Generation with AI-Powered Models.mp42024-06-08 15:33:0444.36 MiB
Revolutionizing Text Embeddings with Synthetic Data from LLMs.mp42024-06-08 15:28:2537.37 MiB
Unlocking ChatGPT’s Potential: How RAFT Transforms Domain-Specific Q&A.mp42024-06-08 15:29:2435.06 MiB
AdaRound: Revolutionizing Post-Training Quantization.mp42024-06-08 15:58:4130.14 MiB
The Future of Image Generation: Inside Visual Autoregressive Modeling (VAR).mp42024-06-08 15:32:0440.37 MiB
Streamlining Transformers: The Power of LayerDrop Explained.mp42024-06-08 15:25:5947.53 MiB
Revolutionizing High-Resolution Depth Estimation with PatchFusion.mp42024-06-08 15:26:4834.48 MiB
Discover OBELICS: The Ultimate Open-Source Multimodal Dataset.mp42024-06-08 16:07:2632.41 MiB
Revolutionizing NLP: Linear Transformers with Learnable Kernels!.mp42024-06-08 15:39:4140.89 MiB
Are Multiple Attention Heads Overrated? 🚀 AI Model Insights!.mp42024-06-08 15:54:3233.40 MiB
Enhancing LLM Training with Neurally Compressed Text: A New Approach!.mp42024-06-08 15:32:4335.40 MiB
Cutting-Edge Photorealistic Text-to-Image Models Explained.mp42024-06-08 15:22:4845.06 MiB
Say Goodbye to RL: Contrastive Preference Learning Explained!.mp42024-06-08 15:56:0140.40 MiB
Dual PatchNorm: A Breakthrough in Vision Transformers.mp42024-06-08 15:55:4138.51 MiB
Game-Changing CNN Upgrade: SPD-Conv Explained!.mp42024-06-08 15:23:0840.44 MiB
Cross-Covariance Transformers: Breaking Barriers in Vision AI.mp42024-06-08 15:45:4932.10 MiB
Learning to Fly: Reinforcement Learning for Quadcopter Control.mp42024-06-08 15:55:5053.02 MiB
LoRA: Revolutionizing Fine-Tuning for Large Language Models.mp42024-06-08 16:07:3835.13 MiB
Is Data Compression the Key to Artificial Intelligence? 🤖📚.mp42024-06-08 16:08:0446.36 MiB
Harnessing Diffusion Models for Superior Neural Network Parameters.mp42024-06-08 15:44:5752.46 MiB
Customizing Images in Seconds: Dive into PhotoVerse's Magic!.mp42024-06-08 15:33:2242.14 MiB
Searchformer: Revolutionizing Planning with Transformers and Search Dynamics Bootstrapping.mp42024-06-08 16:01:0038.65 MiB
Fast and Accurate Model Scaling: The New Approach Revolutionizing CNNs.mp42024-06-08 15:36:2335.64 MiB
Accelerating Large Diffusion Models on Mobile GPUs with Speedy Optimizations.mp42024-06-08 15:23:0428.87 MiB
P-Tuning v2: Revolutionizing Prompt Tuning Across Scales and Tasks.mp42024-06-08 15:33:2533.98 MiB
Rapid 3D Creation from Text with GaussianDreamer!.mp42024-06-08 15:57:3544.87 MiB
Revamping ResNet-50: Modern Training Techniques Explored.mp42024-06-08 15:21:3442.27 MiB
SAM-ON: The Sharpness-Aware Minimization Revolution.mp42024-06-08 15:35:3134.18 MiB
How Watermarking Makes AI Models ‘Radioactive’ – Detecting Data Contamination!.mp42024-06-08 15:43:1534.79 MiB
Gemma: Open-Sourced AI Excellence Derived from Google's Gemini .mp42024-06-08 15:46:3036.48 MiB
RoFormer: Transforming Transformers with Rotary Positional Embeddings.mp42024-06-08 15:53:1635.55 MiB
Supercharge Your GPU: Fine-tune 100B Models with NVMe SSDs!.mp42024-06-08 15:40:4537.71 MiB
SuGaR: Revolutionizing 3D Mesh Extraction & Rendering!.mp42024-06-08 15:42:3746.15 MiB
FlashSpeech: Revolutionizing Fast, High-Quality Speech Synthesis with Zero-Shot Efficiency!.mp42024-06-08 15:25:3935.88 MiB
Unlocking Deep Learning: The Power of Rectified Adam (RAdam).mp42024-06-08 15:22:2641.37 MiB
Reconstructing 3D Scenes with AI: SceneScript Explained.mp42024-06-08 15:48:2775.53 MiB
Unlocking the Power of Simple Modifications in Multimodal Learning.mp42024-06-08 16:00:4748.28 MiB
Unleashing GPT-4V(ision): Revolutionizing Web Agents with Visual Grounding.mp42024-06-08 15:24:2033.00 MiB
JudgeLM: Revolutionizing AI Evaluation with Fine-Tuned Large Language Models .mp42024-06-08 15:44:2840.80 MiB
Decoupled Contrastive Learning: Boosting Efficiency in Self-Supervised Models.mp42024-06-08 15:42:1339.28 MiB
Revolutionizing Deeper Network Visuals with MACO.mp42024-06-08 15:34:3234.48 MiB
Revolutionizing Audio Creation: WavJourney Explained.mp42024-06-08 15:34:2954.37 MiB
Revolutionizing AI: Evolutionary Model Merging Explained!.mp42024-06-08 15:54:5546.48 MiB
Closing the Gap to GPT-4V: Introducing InternVL 1.5!.mp42024-06-08 15:57:5645.42 MiB
One-2-3-45++: Fast 2D Image to High-Fidelity 3D Object Transformation .mp42024-06-08 15:22:0035.29 MiB
More Agents, Better Results: Boosting LLMs Performance with Ensembles.mp42024-06-08 15:31:4641.21 MiB
LLM360: Revolutionizing AI with Fully Transparent LLMs - Amber & CrystalCoder Unveiled!.mp42024-06-08 15:26:5467.56 MiB
Revolutionizing Robotic Coordination with AutoRT!.mp42024-06-08 16:05:3343.52 MiB
DeepSeek LLM: Pioneering Long-Term Vision in Open-Source Language Models.mp42024-06-08 15:37:0643.96 MiB
DragonDiffusion: Revolutionizing Image Editing with Energy-Guided Precision!.mp42024-06-08 15:30:4148.36 MiB
AppAgent: Revolutionizing Smartphone Interaction with AI.mp42024-06-08 15:57:1145.51 MiB
ScreenAI: The Future of UI and Infographics Understanding Unveiled! .mp42024-06-08 15:55:1244.48 MiB
How DDIMs Revolutionize Image Generation: Faster & Efficient!.mp42024-06-08 15:51:3537.07 MiB
Maximizing Visual Data in MLLMs: The Power of Dense Connectors!.mp42024-06-08 15:32:2936.22 MiB
Revolutionizing Attention: Meet the Routing Transformer!.mp42024-06-08 15:24:1442.21 MiB
Unlocking Vision: Routers in Mixture of Experts Explored!.mp42024-06-08 16:06:0831.45 MiB
Boosting Large Language Models to Generate Longer Texts Efficiently!.mp42024-06-08 15:21:5235.12 MiB
ViTAR: Revolutionizing Vision Transformers for Any Resolution!.mp42024-06-08 15:45:4738.04 MiB
DreamTuner: Creating Photo-Realistic Images from a Single Reference Picture.mp42024-06-08 15:50:2343.81 MiB
Revolutionizing Control: The Power of Temporal Difference Learning.mp42024-06-08 15:33:4554.26 MiB
How VideoBooth Revolutionizes Video Creation with Image Prompts! .mp42024-06-08 15:54:3937.81 MiB
Unlocking Faster AI: How WRAP Transforms Language Models with Synthetic Data!.mp42024-06-08 15:40:1850.13 MiB
Decentralizing Large Language Models: Meet Petals.mp42024-06-08 15:38:2435.70 MiB
Revolutionizing Text Generation: The Power of Copy-Generator (CoG).mp42024-06-08 15:42:4942.02 MiB
Turbocharging Transformers: Unveiling Speculative Decoding for Faster Inference.mp42024-06-08 15:23:1841.79 MiB
Grounding DINO 1.5: Next-Gen Object Detection at the Edge.mp42024-06-08 15:39:2249.02 MiB
Unlocking the True Potential of Long-Context Language Models.mp42024-06-08 15:24:4349.31 MiB
Revolutionizing Robot Learning: Finetuning Offline Models for the Real World.mp42024-06-08 15:51:1537.31 MiB
CommonCanvas: Training AI with Creative Commons Images!.mp42024-06-08 15:39:5333.10 MiB
Say Goodbye to AI Hallucinations: How Chain-of-Verification Makes LLMs Smarter.mp42024-06-08 15:38:5529.74 MiB
Revolutionizing AI: Table-GPT Enhances Language Models for Complex Table Tasks!.mp42024-06-08 15:50:0146.19 MiB
EfficientNet: The Future of Model Scaling in Deep Learning.mp42024-06-08 15:21:4135.53 MiB
Unlocking the Full Potential of Diffusion U-Net: Meet FreeU!.mp42024-06-08 15:29:0444.77 MiB
Robotic Revolution in Livestock Farming: Insights from the SELF-AIR Project.mp42024-06-08 15:32:5732.72 MiB
Affordable Robotics: Mastering Precision without Breaking the Bank.mp42024-06-08 15:32:2552.37 MiB
Revolutionizing LLMs: Efficient Inference with Flash Memory.mp42024-06-08 15:47:5341.35 MiB
Unlocking LLM Efficiency: PagedAttention & vLLM Revolutionize Memory Management.mp42024-06-08 15:22:0632.33 MiB
Revolutionizing Image Generation: Meet the Flexible Vision Transformer (FiT).mp42024-06-08 16:04:3643.05 MiB
Unveiling PixArt-$\delta$: Lightning Fast, Precision-Controlled Image Generation!.mp42024-06-08 16:07:5638.63 MiB
ByteEdit: Revolutionizing Image Editing with Speed and Precision .mp42024-06-08 16:07:3561.64 MiB
Optimizing Neural Networks with G.pt: A Game-Changer in AI Training!.mp42024-06-08 16:05:5648.92 MiB
De-Diffusion: Transforming Images into Text for Multi-Modal AI.mp42024-06-08 15:40:2141.69 MiB
MobileDiffusion: Instant Text-to-Image on Your Phone!.mp42024-06-08 15:36:5835.86 MiB
Revolutionary LMDX: Extracting Information from Complex Documents Using AI.mp42024-06-08 15:37:3647.69 MiB
WebGPT: Enhancing Question-Answering with Human-Guiding Browsing!.mp42024-06-08 15:26:1930.48 MiB
Why Transformers Outshine State Space Models in Copying Tasks.mp42024-06-08 15:55:0840.14 MiB
Efficient LLMs: The Breakthrough of Structured Pruning.mp42024-06-08 16:02:4831.87 MiB
Continual Training Revolution: Enhance Your CLIP Models.mp42024-06-08 15:47:1055.51 MiB
OtterHD-8B: Revolutionizing High-Resolution Visual Perception.mp42024-06-08 15:35:5753.57 MiB
LightGlue: Fast & Efficient Local Feature Matching Revolution.mp42024-06-08 15:24:5432.23 MiB
Fortifying LLMs: A Deep Dive into Instruction Hierarchies for Enhanced Security.mp42024-06-08 15:27:4339.34 MiB
PaLM 2: Multilingual Mastery & Efficient Inference.mp42024-06-08 15:52:2132.48 MiB
How Effective Are Low-bit Quantized LLaMA3 Models? An Empirical Analysis.mp42024-06-08 15:38:0234.95 MiB
Unlocking Speed in High-Res Image Synthesis: A Dive into LADD!.mp42024-06-08 15:42:4550.33 MiB
Exploring Kolmogorov–Arnold Networks: The Future of Neural Architecture? .mp42024-06-08 15:51:5454.28 MiB
Modular LLMs: Reusing LoRAs for Adaptable AI Performance.mp42024-06-08 15:25:0537.68 MiB
Unlocking the Future of Language Models Through Retrieval Methods.mp42024-06-08 15:39:4541.54 MiB
LangSplat: Revolutionizing 3D Language Querying .mp42024-06-08 15:54:5140.94 MiB
How Cross-Layer Attention Reduces Transformer Memory Footprint.mp42024-06-08 15:25:2837.32 MiB
SmoothQuant: Efficient & Accurate Quantization for Massive Language Models.mp42024-06-08 16:02:5938.68 MiB
Revolutionizing Text-to-Video: Emu Video’s Two-Step Magic.mp42024-06-08 16:00:5632.12 MiB
InstaFlow: Game-Changing One-Step Text-to-Image Generation in 0.1 Seconds!.mp42024-06-08 15:42:5639.74 MiB
Unlocking Language Models: Direct Preference Optimization.mp42024-06-08 15:36:0139.41 MiB
Unlocking Object-Centric Learning with Slot Attention.mp42024-06-08 16:05:0547.55 MiB
Creating Your Dream Videos with DreamVideo.mp42024-06-08 16:01:2038.05 MiB
How AI Trains Itself: Inside Self-Rewarding Language Models .mp42024-06-08 15:44:3132.12 MiB
AutoWebGLM: Next-Gen AI for Web Navigation Explored!.mp42024-06-08 15:31:4937.44 MiB
Unlocking RLHF: The Power of OpenRLHF for Large Language Models.mp42024-06-08 15:56:4043.55 MiB
Enhancing AI: Making GPT-3 Follow Instructions with Human Feedback.mp42024-06-08 16:00:2536.50 MiB
Achieving Zero-Shot Text-to-Image Generation with Autoregressive Transformers.mp42024-06-08 15:36:3444.13 MiB
Achieving Superior Audio Generation: Unveiling Representation Similarity Regularization.mp42024-06-08 15:46:4035.71 MiB
Fine-Tuning Language Models with Human Feedback: A New Paradigm in NLP.mp42024-06-08 15:57:2741.82 MiB
X-Adapter: Universal Plugin Compatibility for New Diffusion Models Explained!.mp42024-06-08 15:27:5253.52 MiB
Are Emergent Abilities in AI Models Just a Metric Mirage?.mp42024-06-08 15:47:0545.18 MiB
Hawk & Griffin: Revolutionizing Language Models with Efficient Architecture.mp42024-06-08 15:52:1843.34 MiB
Depth Anything: Maximizing Monocular Depth Estimation with Unlabeled Data!.mp42024-06-08 15:55:3040.56 MiB
LLMs: The Surprising Time Series Wizards!.mp42024-06-08 15:52:0655.13 MiB
How Contrastive Decoding Boosts Reasoning in AI Models! 🚀.mp42024-06-08 16:04:4540.92 MiB
Mobile-Agent: Revolutionizing Mobile Devices with Visual Perception.mp42024-06-08 15:47:5747.36 MiB
Unlocking AI Limits: Reward Model Overoptimization Revealed!.mp42024-06-08 15:56:0828.90 MiB
Unlocking Visual Intelligence: The Power of Image World Models (IWM).mp42024-06-08 15:34:1933.61 MiB
Qwen-VL: Revolutionizing Vision-Language Models .mp42024-06-08 15:58:3535.67 MiB
FLIP: Revolutionizing Language-Image Pre-training with Masking!.mp42024-06-08 16:07:4834.16 MiB
Unlocking Complex Texts: RAPTOR’s Revolutionary Retrieval Approach.mp42024-06-08 15:37:0241.27 MiB
Solving Misalignment in Text-to-Image AI: CoMat Explained!.mp42024-06-08 15:29:5045.83 MiB
VideoAgent: Revolutionizing Long-Form Video Understanding with AI.mp42024-06-08 15:50:5742.01 MiB
The Lottery Ticket Hypothesis: Uncovering Trainable Sparse Neural Networks.mp42024-06-08 15:39:1847.12 MiB
Revolutionizing Large Language Models with Layer-Condensed KV Cache.mp42024-06-08 15:42:2933.16 MiB
Instant 3D Magic: Transform Any Single Image to 3D Mesh in 45 Seconds!.mp42024-06-08 15:34:1647.22 MiB
Advancing AI Reasoning: Meet Eurus LLMs and UltraInteract!.mp42024-06-08 15:37:4453.09 MiB
How V3D Revolutionizes 3D Generation with Video Diffusion Models.mp42024-06-08 16:00:2256.97 MiB
Can AI Really See Math? Exploring MathVerse and Multi-Modal Models.mp42024-06-08 15:51:4050.59 MiB
Breaking Language Barriers: PolyLM - The Open Source Polyglot LLM.mp42024-06-08 16:03:5343.14 MiB
I2VGen-XL: Breathing Life Into Static Images with Advanced Video Synthesis.mp42024-06-08 16:06:3651.34 MiB
Editing 3D Scenes with Text Instructions: Meet GaussianEditor.mp42024-06-08 15:45:2751.62 MiB
InstantID: Revolutionary Zero-Shot Image Personalization!.mp42024-06-08 15:58:2755.06 MiB
Boosting RL: The Power of Reusing Data Across Experiments!.mp42024-06-08 16:04:5639.42 MiB
wav2vec 2.0: Revolutionizing Speech Recognition with Self-Supervised Learning.mp42024-06-08 15:36:0542.53 MiB
MindAgent: How AI is Revolutionizing Gaming Collaboration.mp42024-06-08 15:50:2738.96 MiB
Unlocking New Levels of Language Modeling with OpenELM! 🧠✨.mp42024-06-08 15:25:0941.39 MiB
How PERL Revolutionizes Reinforcement Learning with Human Feedback .mp42024-06-08 16:04:2131.27 MiB
Solving Long-Sequence Challenges with Extrapolatable Transformers.mp42024-06-08 15:44:5330.76 MiB
Latent Consistency Models: Ultra-Fast, High-Resolution Image Synthesis.mp42024-06-08 15:43:4446.99 MiB
Can AI-Generated Data Rival Real Data? Discover SynCLR.mp42024-06-08 16:02:4534.11 MiB
How Large Language Models are Revolutionizing Optimization.mp42024-06-08 15:36:5143.19 MiB
DMV3D: Breakthrough in High-Fidelity 3D Reconstruction and Denoising.mp42024-06-08 15:25:1853.10 MiB
Training Language Models with Less Communication: DiLoCo Method.mp42024-06-08 15:25:0242.74 MiB
Winning the RLHF Game: Mastering Reward Modeling in AI.mp42024-06-08 15:39:5057.25 MiB
Unlocking 400K Token Contexts in LLMs with Activation Beacon!.mp42024-06-08 16:02:0246.58 MiB
From Single Image to 3D: The Magic of LRM .mp42024-06-08 15:27:0440.30 MiB
Kosmos-2.5: The Future of Multimodal Text & Image Understanding.mp42024-06-08 15:30:0539.64 MiB
Revolutionizing Depth Estimation with Diffusion Models: Meet Marigold!.mp42024-06-08 16:07:1534.65 MiB
YOLOv9: Revolutionizing Object Detection with Programmable Gradients.mp42024-06-08 15:45:1130.94 MiB
The Secret to Scaling Deep Reinforcement Learning: Mixtures of Experts.mp42024-06-08 15:59:2534.96 MiB
Octopus v4: Revolutionizing Language Models with Graph-Based AI.mp42024-06-08 15:26:4136.49 MiB
Rapid 3D Scene Generation with GRM: A Game Changer in Graphics!.mp42024-06-08 15:57:4059.96 MiB
Revolutionizing Traffic Forecasting: The Power of Spatial-Temporal Transformers.mp42024-06-08 16:06:2540.53 MiB
What BERT Focuses On: Unveiling Attention Patterns.mp42024-06-08 15:44:4335.50 MiB
Latent Quantization Breakthrough: Disentangled Representations Explored!.mp42024-06-08 15:50:1638.95 MiB
Orca 2: Enhancing Small Language Models' Reasoning Skills.mp42024-06-08 16:06:0550.18 MiB
Unlocking Complex Problem-Solving in AI: Skills-in-Context Prompting Explored.mp42024-06-08 15:31:0140.65 MiB
Transforming Object Detection: A Deep Dive into DETR by Facebook AI.mp42024-06-08 15:27:1543.71 MiB
COCONut: Revolutionizing COCO with Next-Gen Segmentation Annotations!.mp42024-06-08 16:03:2341.79 MiB
Wilbur: Revolutionizing Web Agents with Adaptive Learning.mp42024-06-08 15:33:3644.74 MiB
AnyGPT: Unifying Speech, Text, Images, and Music with Ease.mp42024-06-08 15:36:1634.44 MiB
DreamGaussian: Fast and High-Quality 3D Content Creation Unveiled!.mp42024-06-08 15:55:2045.83 MiB
NExT-GPT: The Future of Any-to-Any Multimodal AI!.mp42024-06-08 16:03:0632.46 MiB
Alpha-CLIP: Next-Gen Image Recognition with Precision Focus.mp42024-06-08 15:50:3542.17 MiB
Unlocking 3D Secrets in Latent Diffusion Models .mp42024-06-08 15:36:2040.77 MiB
Larimar: Revolutionizing Large Language Models with Brain-Inspired Memory Control.mp42024-06-08 15:55:0534.46 MiB
Vary-toy: Compact Vision Language Model Revolutionizing AI Research!.mp42024-06-08 15:26:1634.78 MiB
Compressing Trillion-Parameter Models: The QMoE Breakthrough Explained!.mp42024-06-08 15:49:2231.58 MiB
Unifying Transformers: Magneto's Marvel in AI.mp42024-06-08 15:53:5341.55 MiB
Speeding Up AI: Speculative Streaming for Fast LLM Inference.mp42024-06-08 15:59:3852.47 MiB
Breaking Limits: LongRoPE Extends LLM Context to Over 2 Million Tokens!.mp42024-06-08 15:41:1037.81 MiB
Revolutionizing Deep Learning with Global Context Networks.mp42024-06-08 15:53:4942.03 MiB
WaveCoder: Revolutionizing Code LLMs with CodeOcean Dataset!.mp42024-06-08 15:35:1139.97 MiB
How 26 Principles Supercharge Your AI: Boosting GPT-4 and LLaMA Performance.mp42024-06-08 15:29:3839.24 MiB
Mastering AI Code Generation: StepCoder's Revolutionary RL Approach.mp42024-06-08 15:48:5943.90 MiB
ResMLP: The Future of Image Classification?.mp42024-06-08 15:34:0152.38 MiB
Discovering SimCLR: A New Era in Contrastive Learning.mp42024-06-08 15:25:3535.98 MiB
Can AI Plan Your Next Vacation? Exploring TravelPlanner's Real-World Challenge!.mp42024-06-08 15:22:0337.21 MiB
Boosting AI with Self-Alignment and Instruction Backtranslation .mp42024-06-08 15:31:1941.63 MiB
How MARGE Is Revolutionizing Language Models Through Paraphrasing!.mp42024-06-08 16:01:1635.53 MiB
Can Transformers Thrive Without Attention? Exploring Feed-Forward Networks.mp42024-06-08 16:07:4235.68 MiB
Bridging Deep Learning & Symbolic AI: PrediNet Explained!.mp42024-06-08 15:22:4451.35 MiB
LLMs: The Future Tool Makers in AI.mp42024-06-08 15:59:0439.53 MiB
Practical Dataset Poisoning: A Deep Dive into Vulnerabilities.mp42024-06-08 15:56:1532.58 MiB
Extending AI's Memory: E2-LLM Breakthrough in Large Language Models .mp42024-06-08 15:33:1845.18 MiB
Red Teaming Language Models: Methods and Lessons Uncovered!.mp42024-06-08 15:39:2637.78 MiB
The Truth About 'Zero-Shot': Why More Data Always Wins!.mp42024-06-08 15:48:0945.47 MiB
Cracking the Code: How LLaMA is Revolutionizing Non-English AI.mp42024-06-08 15:26:2639.86 MiB
Efficient 3D GANs: A Leap in Quality and Consistency!.mp42024-06-08 16:03:3853.57 MiB
Boosting AI Reasoning: Unraveling Iterative RPO for Better Logic.mp42024-06-08 15:41:5540.98 MiB
Indus: Specialized and Efficient Language Models for Science.mp42024-06-08 15:47:4640.17 MiB
MobileLLM: Revolutionizing Efficient Language Models for Smartphones .mp42024-06-08 16:00:2944.31 MiB
Rho-1: Transforming Language Models with Selective Token Training.mp42024-06-08 15:31:3143.86 MiB
A Deeper Dive into diffGrad: Revolutionizing CNN Optimization.mp42024-06-08 15:43:1942.59 MiB
Unpacking MM1: The Future of Multimodal Large Language Models .mp42024-06-08 15:39:3341.07 MiB
Idempotent Generative Networks: Revolutionizing Single-Step Image Generation!.mp42024-06-08 15:41:5940.78 MiB
RWKV: The Future of Sequence Processing in AI.mp42024-06-08 15:47:2942.42 MiB
BYOL: Mastering Self-Supervised Learning Without Negative Pairs.mp42024-06-08 15:47:1442.41 MiB
Speed Up Diffusion Models with Progressive Distillation!.mp42024-06-08 15:32:4735.51 MiB
Mastering Reinforcement Learning with World Models.mp42024-06-08 16:00:0044.98 MiB
Unraveling the Transformer-in-Transformer Model.mp42024-06-08 16:00:5340.50 MiB
Exploring MiniGPT-5: Next-Gen Vision and Language Generation.mp42024-06-08 15:37:1131.56 MiB
Diffusion-GAN: A Breakthrough in Stable GAN Training.mp42024-06-08 16:03:1542.35 MiB
Real-Time Feedback Boosts Continual Learning in AI Instruction Agents!.mp42024-06-08 16:01:4744.86 MiB
PhotoMaker: Revolutionizing Custom Human Photos with AI Magic! .mp42024-06-08 15:47:2541.21 MiB
Revolutionizing Deep Learning: ReinMax vs. Straight-Through .mp42024-06-08 15:31:0434.99 MiB
Revolutionary Real-time Avatars: Perpetual Humanoid Control Explained.mp42024-06-08 15:38:2742.40 MiB
EfficientViT: The Future of Vision AI with Multi-Scale Linear Attention.mp42024-06-08 16:03:2053.31 MiB
Cracking the Code: DoRA’s Low-Rank Adaptation for Efficient Fine-Tuning.mp42024-06-08 15:35:4137.25 MiB
Revolutionizing Memory Networks: Meet MEMO.mp42024-06-08 15:26:1337.18 MiB
Pushing NLP Boundaries: The Power of T5's Unified Text-to-Text Transformer.mp42024-06-08 15:42:1037.43 MiB
Mastering Robots with Diffusion Policy: A Breakthrough in Visuomotor Learning.mp42024-06-08 15:48:1639.36 MiB
Revolutionizing Face Swapping with Face-Adapter!.mp42024-06-08 15:47:3450.66 MiB
Revolutionizing Image Generation: Inside the Paella Model.mp42024-06-08 15:27:2338.33 MiB
Unleashing FP8 Power: Efficiently Training Massive LLMs.mp42024-06-08 15:57:1545.74 MiB
Mastering EMA for Large-Scale Machine Learning .mp42024-06-08 15:33:4940.20 MiB
Stacking Transformers: Efficient Pre-Training for LLMs Explained.mp42024-06-08 15:23:3834.52 MiB
Beware: Hidden Traps in Pre-trained AI Models!.mp42024-06-08 15:34:0838.28 MiB
GaLore: Revolutionizing LLM Training with Memory-Efficient Gradient Projections.mp42024-06-08 16:05:5241.32 MiB
Revolutionize LLMs: BitNet b1.58 Brings 1.58-bit Efficiency!.mp42024-06-08 15:36:4744.36 MiB
DeepSeekMoE: Revolutionizing Expert Specialization in Language Models.mp42024-06-08 15:53:4264.30 MiB
AI Feedback vs Human Feedback: Revolutionizing Reinforcement Learning (RLAIF).mp42024-06-08 16:04:2536.75 MiB
Unlocking Advanced Reasoning: Chain of Code Explained .mp42024-06-08 15:35:2846.23 MiB
Transform Real-World Videos into Interactive Games with Video2Game!.mp42024-06-08 15:47:1848.69 MiB
Revolutionizing 3D Scenes: The Power of 2D Gaussian Splatting .mp42024-06-08 15:44:0551.58 MiB
Meet Med-Flamingo: Revolutionizing Medical AI with Few-Shot Learning!.mp42024-06-08 15:55:0141.40 MiB
Unleashing Hidden Power: How LLM2Vec Transforms Language Models into Text Encoders.mp42024-06-08 15:58:4544.96 MiB
Q-Instruct: Elevating Low-Level Visual Skills in AI Models.mp42024-06-08 15:38:5940.33 MiB
Boosting AI Reasoning: Contrastive Chain-of-Thought Explained!.mp42024-06-08 16:07:4540.20 MiB
MoE-LLaVA: Efficient Scaling of Vision-Language Models with Mixture of Experts.mp42024-06-08 16:02:2633.93 MiB
Ferret-UI: Revolutionary Mobile UI Interaction with Multimodal LLMs.mp42024-06-08 15:48:5234.27 MiB
Grokked Transformers: Secrets of Implicit Reasoning Unveiled.mp42024-06-08 15:46:0448.72 MiB
Simple Image Retrieval Beats Diffusion Models in Data Augmentation.mp42024-06-08 15:29:3437.99 MiB
Decoding AI: Transformer Programs into Python Code!.mp42024-06-08 15:49:2939.00 MiB
Revolutionizing 3D Meshes: Super Fast, High-Quality Reconstruction with MeshLRM!.mp42024-06-08 15:59:4646.88 MiB
VideoMamba Unleashed: Next-Gen State Space Model for Video Mastery.mp42024-06-08 15:54:2539.56 MiB
Generate Perfect Images Anywhere: Discover PACGen for Ultimate Control!.mp42024-06-08 15:46:3332.48 MiB
Distil-Whisper: Faster, Smaller, Yet Powerful Speech Recognition!.mp42024-06-08 15:48:4535.59 MiB
Real-Time Radiance Fields: How 3D Gaussian Splatting is Changing the Game.mp42024-06-08 15:27:3134.87 MiB
Scaling Down AI: Breakthrough in Efficient Stable Diffusion Models!.mp42024-06-08 16:06:1132.63 MiB
HyperDiffusion: Generating Stunning 3D and 4D Shapes with Neural Fields.mp42024-06-08 15:37:1836.84 MiB
How LLaMA Pro Revolutionizes AI with Block Expansion.mp42024-06-08 16:02:2335.94 MiB
Revolutionizing AI: Faster & Smarter Language Models with Multi-Token Prediction.mp42024-06-08 15:27:2752.33 MiB
Enhancing Code Generation: AlphaCodium’s Multi-Stage Approach Explained.mp42024-06-08 15:50:4240.55 MiB
Revolutionizing 3D Reconstruction: Gamba's Innovative Techniques Explained!.mp42024-06-08 15:30:3045.63 MiB
Unlocking Unlimited Sequence Lengths: Introducing Lightning Attention-2!.mp42024-06-08 16:04:2834.76 MiB
Vision Transformers vs. ResNets: New Insights with SAM.mp42024-06-08 15:36:5540.99 MiB
OK-Robot: Merging Vision-Language Models with Robotics for Home Automation.mp42024-06-08 15:48:3444.74 MiB
How Step-by-Step Verification Boosts AI Reasoning! .mp42024-06-08 15:40:4834.60 MiB
The Perceiver: Revolutionizing Multi-Modal Deep Learning!.mp42024-06-08 15:22:1841.61 MiB
How Diffusion Models Revolutionize Atari Game AI.mp42024-06-08 15:32:4039.09 MiB
DeepSeek-VL: Revolutionizing Real-World Vision-Language Understanding.mp42024-06-08 15:45:0540.55 MiB
Bridging the Gap: Objective Mismatch in Reinforcement Learning.mp42024-06-08 15:43:3138.80 MiB
Byte Models: Simulating the Digital World with bGPT.mp42024-06-08 16:04:3248.34 MiB
StreamMultiDiffusion: Real-Time Image Generation with Semantic Control.mp42024-06-08 15:32:5148.51 MiB
Scaling Vision Transformers to New Heights: ViT-22B Explored.mp42024-06-08 15:54:3533.09 MiB
Unlocking Visual Intelligence: SODA Diffusion Models Explained.mp42024-06-08 15:32:0141.00 MiB
How Vision Transformers Conquer Small Datasets!.mp42024-06-08 15:51:2730.01 MiB
LLaVA-Plus: Revolutionizing Multimodal Assistants with Tool Learning.mp42024-06-08 15:26:0644.57 MiB
Unleashing Neural ODEs: The Future of Deep Learning Explained!.mp42024-06-08 15:57:0735.72 MiB
Unveiling E(n) Equivariant Graph Neural Networks!.mp42024-06-08 15:56:4440.90 MiB
Extending Context Windows in LLMs with Position Interpolation .mp42024-06-08 16:00:1729.95 MiB
Unlocking the Power of Simple Siamese Networks.mp42024-06-08 15:48:5538.75 MiB
OOTDiffusion: Revolutionary Virtual Try-On Using Latent Diffusion Models.mp42024-06-08 15:32:1347.05 MiB
HyperDreamBooth: Breakthrough in Fast Face Personalization for AI Art.mp42024-06-08 15:28:1143.79 MiB
Turbocharge Your Language Models with Trillions of Tokens! Meet Retro 🚀.mp42024-06-08 15:22:3940.17 MiB
Unlocking Unified Visual Understanding: Video-LLaVA Explained!.mp42024-06-08 16:01:3541.89 MiB
PixArt-$\alpha$: Revolutionizing Text-to-Image Synthesis with Low Training Costs!.mp42024-06-08 16:06:4439.46 MiB
Speeding Up Language Models: Fast Inference with Mixture-of-Experts.mp42024-06-08 15:47:0137.43 MiB
Uni-SMART: Revolutionizing Multimodal Scientific Research!.mp42024-06-08 15:46:4634.55 MiB
Llama 2: Redefining Large Language Models with Safety and Open Foundation.mp42024-06-08 15:51:3250.66 MiB
FlashAttention: Revolutionizing Transformer Efficiency!.mp42024-06-08 15:35:2440.18 MiB
Ferret Multimodal Model: Refer and Ground Anything Anywhere!.mp42024-06-08 15:51:4332.88 MiB
RealmDreamer: Revolutionizing 3D Scenes from Text with Advanced Inpainting & Depth Diffusion.mp42024-06-08 15:43:3951.95 MiB
Zamba: The Next Big Thing in Efficient Language Models.mp42024-06-08 16:06:5034.25 MiB
Smoother and Safer Robot Training with gSDE in Reinforcement Learning!.mp42024-06-08 15:49:0230.68 MiB
Unlocking the Power of Cleaner Data: Enhancing Language Models Through Deduplication.mp42024-06-08 16:03:0339.86 MiB
How ControlNet++ Revolutionizes Image Consistency in AI Generation.mp42024-06-08 16:00:3234.13 MiB
DeepSpeed-VisualChat: Revolutionizing Multi-Image, Multi-Round AI Conversations.mp42024-06-08 15:45:5630.49 MiB
Train Big, Compress Smart: New Secrets to Speedy AI.mp42024-06-08 15:51:4932.50 MiB
Motion Mamba: The Future of Efficient Human Motion Generation.mp42024-06-08 15:24:1045.14 MiB
Perceiver AR: Revolutionizing Long-Context Modeling.mp42024-06-08 15:46:5441.27 MiB
Revolutionizing 3D Point Clouds: Meet the Point Transformer!.mp42024-06-08 15:59:5338.77 MiB
Scaling AI: Inside Google's 540B PaLM Model.mp42024-06-08 15:41:4840.17 MiB
How Language Models Double as Top-Tier Compressors!.mp42024-06-08 16:04:0739.30 MiB
How We Trained a 101B-Parameter LLM on a $100K Budget! .mp42024-06-08 15:49:5739.52 MiB
WebArena: Elevating Autonomous Agents in Realistic Web Scenarios.mp42024-06-08 16:01:1342.98 MiB
Revolutionary Breakthrough in Machine Translation with ALMA!.mp42024-06-08 15:46:3733.62 MiB
Unveiling Lion: The Breakthrough Optimizer Unlocked by AI!.mp42024-06-08 15:45:3142.53 MiB
FlashDecoding++: Revolutionizing GPU Inference Speeds for Large Language Models.mp42024-06-08 15:47:3848.93 MiB
Inheritune: Training Small Language Models with Minimal Data and Compute.mp42024-06-08 15:28:4430.44 MiB
Mastering Video Motions: Deep Dive into VMC with Temporal Attention Adaptation!.mp42024-06-08 16:05:0042.87 MiB
BERTScore: Revolutionizing Text Evaluation with Contextual Embeddings.mp42024-06-08 16:06:1540.10 MiB
Ferret-v2: Next-Level Referring and Grounding with Enhanced LLMs!.mp42024-06-08 15:53:0547.25 MiB
Revolutionary Recommendation System: Meet SPAR with Long Engagement Attention! .mp42024-06-08 15:58:2242.92 MiB
Advancing Theorem Proving with AI and Synthetic Data.mp42024-06-08 15:26:3330.39 MiB
Personalizing Text-to-Image Models with DreamBooth!.mp42024-06-08 15:43:0151.09 MiB
Why Large Language Models Fail with Long Texts: A Deep Dive.mp42024-06-08 15:44:5036.80 MiB
Revolutionizing Video Generation: Introducing VideoLCM!.mp42024-06-08 15:45:3943.53 MiB
One TTS Alignment Framework: Revolutionizing Text-to-Speech Accuracy.mp42024-06-08 16:01:5041.02 MiB
GPQA: The Ultimate Grad-Level Challenge for AI & Humans!.mp42024-06-08 16:06:0047.60 MiB
Enhancing AI: Mastering Helpfulness & Harmlessness .mp42024-06-08 15:35:4545.96 MiB
Cutting-Edge Hybrid Zoom for Smartphones: Explained!.mp42024-06-08 15:54:5830.83 MiB
Revolutionizing Neural Networks with Periodic Activation Functions.mp42024-06-08 15:30:5043.55 MiB
DreamReward: Revolutionizing Text-to-3D Generation with Human Preferences!.mp42024-06-08 15:49:1948.11 MiB
Unveiling HallusionBench: Tackling Visual Illusions in AI Models!.mp42024-06-08 15:30:5737.82 MiB
Inherent Fairness in AI: Optimizing Face Recognition Models.mp42024-06-08 15:28:5637.67 MiB
Meet aMUSEd: A Lightweight Revolution in Text-to-Image Generation.mp42024-06-08 15:46:0844.88 MiB
BLEURT: The New Gold Standard for Text Generation Metrics!.mp42024-06-08 16:02:5235.99 MiB
Unlocking Multilingual Power in CLIP with AltCLIP.mp42024-06-08 15:37:0826.04 MiB
Scaling Transformers: The DeepNet Breakthrough!.mp42024-06-08 15:40:3228.71 MiB
The Ultimate Chinese Benchmark: Unpacking the CMMMU.mp42024-06-08 15:48:0146.79 MiB
Revolutionizing Image Caption Evaluation with CLIPScore!.mp42024-06-08 15:29:1639.94 MiB
Is AI Really Thinking? Exploring Faithfulness in Chain-of-Thought Reasoning.mp42024-06-08 15:32:3736.62 MiB
The Future of Vision: Neighborhood Attention Transformer Explained!.mp42024-06-08 15:27:1243.43 MiB
Maximizing Efficiency with Compute-Optimal Language Models.mp42024-06-08 15:40:0138.91 MiB
How ChatGPT's Skills Are Evolving: Surprising Decreases and Increases!.mp42024-06-08 15:52:4344.86 MiB
How Skeleton-of-Thought Makes AI Faster Without Sacrificing Quality.mp42024-06-08 16:05:2231.70 MiB
FreeInit: Revolutionizing Video Diffusion with Enhanced Initialization.mp42024-06-08 15:44:2036.92 MiB
Mistral 7B: Redefining Efficiency in NLP Models.mp42024-06-08 16:04:1142.88 MiB
Efficient Image & Video Generation with Recurrent Interface Networks (RINs).mp42024-06-08 15:49:3340.72 MiB
Inside the Mind of RMDT: Revolutionizing Reinforcement Learning.mp42024-06-08 15:30:0834.47 MiB
UFOGen: Revolutionizing Text-to-Image Generation with One-Step Diffusion GANs.mp42024-06-08 15:47:4933.44 MiB
Can GPT-4 Effectively Explore? Insightful Findings from AI Research.mp42024-06-08 15:40:5240.56 MiB
“AI vs. Doctors: How Adapted Large Language Models Excel in Clinical Text Summarization”.mp42024-06-08 15:23:4440.91 MiB
SoundStorm: Revolutionizing Audio Generation with Speed and Quality!.mp42024-06-08 15:52:2836.10 MiB
How FastV is Revolutionizing Large Vision-Language Models!.mp42024-06-08 16:06:3238.18 MiB
Smooth Text-to-Video Magic: Discover Dual-Stream Diffusion Net!.mp42024-06-08 16:02:3038.59 MiB
DiffusionGPT: Revolutionizing Text-to-Image with LLMs and Expert Models.mp42024-06-08 15:25:4349.17 MiB
OLMo: A Leap Forward in Transparent Language Models.mp42024-06-08 15:43:5139.07 MiB
Octopus v2: Revolutionizing On-Device AI for Super Agents!.mp42024-06-08 15:32:3348.98 MiB
High-Resolution Image Synthesis with Rectified Flow Transformers.mp42024-06-08 15:38:0940.26 MiB
Taming Transformers for Stunning High-Resolution Images.mp42024-06-08 15:31:5344.13 MiB
Right for the Wrong Reasons: Syntactic Heuristics in AI Models Explained.mp42024-06-08 15:59:1142.84 MiB
Vidu4D: Mastering High-Fidelity 4D Reconstructions from Single Videos .mp42024-06-08 15:25:5444.99 MiB
MobileVLM: Revolutionizing Mobile Vision Language Models.mp42024-06-08 15:30:4644.63 MiB
Self-Discover: LLMs Unleashing New Reasoning Powers! .mp42024-06-08 15:48:3840.59 MiB
ELLA: Revolutionizing Text-to-Image Generation with Large Language Models.mp42024-06-08 16:07:0142.67 MiB
Voyager: Mastering Minecraft with a Lifelong Learning Agent.mp42024-06-08 15:41:3847.74 MiB
Revolutionizing Diffusion Models: Human Feedback without Reward Models.mp42024-06-08 15:41:0339.15 MiB
DINOv2: Mastering Visual Features Without Labels.mp42024-06-08 15:52:2544.07 MiB
Swin Transformer: Revolutionizing Vision with Shifted Windows.mp42024-06-08 16:04:1541.02 MiB
Diffusion-Based Planning: The Future of Flexible Behavior Synthesis!.mp42024-06-08 15:55:3844.54 MiB
LLaSM: A New Era in Multimodal AI for Speech and Language.mp42024-06-08 16:03:4936.29 MiB
Aligning Language Models to Regulation-Specific Needs | Arxflix.mp42024-06-08 15:58:0042.12 MiB
ToolLLM: Revolutionizing AI with Real-World API Mastery.mp42024-06-08 15:30:1937.49 MiB
Unlocking Neural Network Efficiency: Thermodynamic Natural Gradient Descent Explained.mp42024-06-08 15:58:3145.60 MiB
ShortGPT: Redefining Efficiency in Large Language Models!.mp42024-06-08 15:39:1433.86 MiB
Unlocking Text-to-Image Personalization: Meet Perfusion!.mp42024-06-08 15:42:0342.68 MiB
Striped Attention: Revolutionizing Causal Transformers!.mp42024-06-08 15:52:5241.60 MiB
Unified-IO 2: A New Frontier in Multimodal AI .mp42024-06-08 15:29:4242.11 MiB
Supercharging AI: How LayerSkip Enhances Language Model Speed and Efficiency.mp42024-06-08 15:24:3541.07 MiB
Unleashing Speed: Consistency Models for Fast Generative AI.mp42024-06-08 16:03:3339.80 MiB
PaLI-3: Unveiling the Power of Compact and Efficient Vision Language Models .mp42024-06-08 16:01:5840.71 MiB
Unlocking the Future of AI: Branch-Train-MiX (BTX) Explained.mp42024-06-08 15:56:0550.18 MiB
Empowering AI: Scaling Instructable Agents in 3D Worlds!.mp42024-06-08 15:56:3361.83 MiB
Unifying Video and Language Understanding with RingAttention.mp42024-06-08 15:34:4436.11 MiB
Revolutionizing Large Language Models: OneBit's 1-Bit Quantization Breakthrough.mp42024-06-08 16:00:0438.33 MiB
RoboVQA: Revolutionizing Robotics with Multimodal Long-Horizon Reasoning! .mp42024-06-08 15:50:0834.22 MiB
How Current Language Models Struggle with Long Contexts: Key Insights.mp42024-06-08 15:57:4847.92 MiB
Revolutionizing AI Art: How IP-Adapter Enhances Text-to-Image Models!.mp42024-06-08 15:28:5247.39 MiB
Exploring the Dark Side: Adversarial Attacks on Aligned Language Models.mp42024-06-08 15:26:5736.95 MiB
Revolutionizing Biomedical NLP: Domain-Specific Pretraining.mp42024-06-08 15:46:4335.64 MiB
HaloNets: The Future of Efficient Visual Backbones.mp42024-06-08 15:51:0551.96 MiB
Unleashing Style: Text-to-Image Generation with StyleDrop.mp42024-06-08 15:40:5943.70 MiB
Simplifying Object Detection with Plain Vision Transformers!.mp42024-06-08 15:42:1743.98 MiB
Mastering Video Generation: Meet MotionCtrl!.mp42024-06-08 15:30:5341.02 MiB
Unveiling LMSYS-Chat-1M: A Million Real-World LLM Conversations Explored .mp42024-06-08 15:45:0840.23 MiB
rl_reach: Simplifying Robotic RL Experiments.mp42024-06-08 15:27:3941.52 MiB
Early Dropout: Boosting Model Performance by Reducing Underfitting.mp42024-06-08 15:24:4745.92 MiB
Enhancing AI Safety with Safe RLHF: Balancing Helpfulness and Harmlessness.mp42024-06-08 15:35:0049.52 MiB
Reconstructing Cartoons in 3D: Toon3D Explained.mp42024-06-08 15:33:1449.25 MiB
Revolutionizing NLP: Meet GRIT - The Unified Model for Text Generation and Embedding.mp42024-06-08 16:01:4335.46 MiB
DeepSeekMath: Revolutionizing Mathematical Reasoning in Open-Source AI.mp42024-06-08 16:02:4250.82 MiB
High-Resolution Image Generation with Residual Quantization: Unlocking New Possibilities!.mp42024-06-08 15:50:4645.42 MiB
Decoding Scaling Laws in Neural Language Models: The Path to Efficiency!.mp42024-06-08 15:37:5847.47 MiB
Revolutionizing Language Models: Mixtral's Sparse Mixture of Experts Unveiled .mp42024-06-08 15:23:2637.72 MiB
MusicAgent: Revolutionizing Music Creation with AI & Large Language Models.mp42024-06-08 15:40:3045.28 MiB
Unifying Multimodal Learning: The Meta-Transformer Revolution .mp42024-06-08 15:38:4750.08 MiB
Unleashing Infinite Power: Scaling $n$-gram Models to Trillions of Tokens.mp42024-06-08 15:25:1340.73 MiB
Behavior Alignment via Reward Function Optimization: A Deep Dive.mp42024-06-08 15:43:0839.30 MiB
How Hackers Could Poison ChatGPT (and What to Do About It).mp42024-06-08 15:55:4537.53 MiB
TextSquare: Elevating Open-Source Models with Square-10M Dataset!.mp42024-06-08 15:23:1028.94 MiB
Revolutionizing Healthcare with MedAlign: How Clinician-Generated Data is Shaping LLM Performance.mp42024-06-08 15:35:0742.25 MiB
How Saliency-Guided Q-Networks Revolutionize Visual Reinforcement Learning.mp42024-06-08 15:22:1437.78 MiB
The Future of AI: Exploring Perceiver IO's General Architecture.mp42024-06-08 15:29:1241.68 MiB
Editing Factual Associations in GPT with ROME.mp42024-06-08 16:02:2033.50 MiB
GAIA: Benchmarking the True Capabilities of AI Assistants .mp42024-06-08 15:29:3138.48 MiB
Unleashing Phi-3-mini: Powerful AI on Your Phone .mp42024-06-08 15:48:0538.85 MiB
Simple Diffusion: Revolutionary High-Resolution Image Generation.mp42024-06-08 15:34:5255.25 MiB
How NaViT Revolutionizes Vision Transformers: Beyond Fixed Resolutions.mp42024-06-08 15:27:4746.27 MiB
StructLM: Revolutionizing AI with Generalist Models for Structured Knowledge.mp42024-06-08 15:37:2132.98 MiB
Discovering MagicTime: Transforming Text into Realistic Metamorphic Time-lapse Videos.mp42024-06-08 16:03:4246.15 MiB
Master Image Editing with DragGAN: Precise Interactive Manipulation.mp42024-06-08 15:36:3949.59 MiB
Performers: Efficient Transformers Explained.mp42024-06-08 15:59:5636.29 MiB
Speedy 3D Creation with LN3Diff: Game-Changing Latent Neural Fields.mp42024-06-08 15:35:5239.43 MiB
Unlocking Vision Models: Scalable Autoregressive Image Training Unveiled! .mp42024-06-08 15:38:0534.98 MiB
Unleashing LLM Power: How Scaling and Finetuning Transform Performance.mp42024-06-08 15:30:0138.27 MiB
Next-Gen Captions: Unveiling Visual Fact Checker.mp42024-06-08 16:07:1240.10 MiB
ESB: The Future of Multi-Domain Speech Recognition!.mp42024-06-08 15:43:3540.65 MiB
Create Realistic 3D Avatars from Text: Make-A-Character Explained.mp42024-06-08 15:22:3265.03 MiB
Revolutionizing 3D Asset Creation: ComboVerse Explored!.mp42024-06-08 15:40:1344.52 MiB
Revolutionizing Video Generation: Mora's Multi-Agent Framework Explained.mp42024-06-08 15:31:1139.64 MiB
Unlocking Faster AI: Medusa's Multi-Head Decoding for LLMs.mp42024-06-08 15:36:1350.47 MiB
Mastering Text-to-Image Diffusion: The RPG Framework Unveiled!.mp42024-06-08 15:49:4142.45 MiB
Unlocking Transformers: The Secret Connection to RNNs Revealed!.mp42024-06-08 16:05:1941.82 MiB
Whisper: Revolutionizing Speech Recognition with Weak Supervision.mp42024-06-08 16:02:0937.06 MiB
Unlocking Zero-Shot Multimodal Reasoning with Socratic Models.mp42024-06-08 15:22:2246.13 MiB
Unlocking Visual Understanding: TokenLearner Explained.mp42024-06-08 15:28:4139.11 MiB
Revolutionizing Image-to-Video Generation with ConsistI2V!.mp42024-06-08 15:53:0052.81 MiB
Revolutionizing Image Synthesis with Hourglass Diffusion Transformers (HDiT).mp42024-06-08 15:57:0150.13 MiB
Mastering Text-to-Image Diffusion with Orthogonal Finetuning (OFT).mp42024-06-08 15:42:4043.11 MiB
Kandinsky: Revolutionizing Text-to-Image Synthesis with Prior Models & Latent Diffusion.mp42024-06-08 15:54:4233.88 MiB
Mega-TTS 2: Revolutionizing Zero-Shot Text-to-Speech with Longer Prompts!.mp42024-06-08 15:44:1239.63 MiB
Revolutionizing Windows: Meet UFO - The Ultimate UI Agent!.mp42024-06-08 15:38:2052.75 MiB
Scaling Vision Models: Inside Swin Transformer V2.mp42024-06-08 15:58:0336.02 MiB
Revolutionizing AI: Direct Language Model Alignment with Online Feedback.mp42024-06-08 15:57:5235.63 MiB
Unleashing the Phased Consistency Model - Efficient Image Generation Explained!.mp42024-06-08 15:37:1436.25 MiB
Unlocking REALM: The Next Evolution in Language Models.mp42024-06-08 15:51:5833.72 MiB
Transforming Vision with Conditional Positional Encodings in Vision Transformers.mp42024-06-08 15:37:4037.51 MiB
The Secret Sauce Behind Self-Supervised Learning Without Pairs.mp42024-06-08 15:56:2339.89 MiB
Smaller Vision Models with Big Impact: Discover the S2 Scaling Revolution!.mp42024-06-08 15:54:0545.81 MiB
Exploring Hierarchical Text-Conditional Image Generation with CLIP Latents.mp42024-06-08 15:51:0840.60 MiB
Tree of Thoughts: Revolutionizing AI Problem Solving.mp42024-06-08 15:52:4849.49 MiB
Mastering AI: The Schedule-Free Learning Revolution.mp42024-06-08 15:22:5539.99 MiB
Learning Sounds Like Humans: Minimal Supervision Framework Explained.mp42024-06-08 15:29:4537.09 MiB
MobiLlama: Revolutionizing Efficient AI for Edge Devices.mp42024-06-08 15:41:2339.74 MiB
A Deep Dive into TQC: Tackling Overestimation Bias in RL.mp42024-06-08 15:36:3039.66 MiB
Unveiling PIPPA: The Ultimate Conversational AI Dataset for Role-Play.mp42024-06-08 15:54:2942.60 MiB
The Platonic Representation Hypothesis: How AI Models Converge Towards a Unified Reality.mp42024-06-08 15:58:1146.60 MiB
Florence-2: The Future of Unified Vision Tasks!.mp42024-06-08 15:35:3765.68 MiB
MagiCapture: Revolutionizing High-Resolution Portrait Customization!.mp42024-06-08 15:56:2049.73 MiB
Breakthrough in Document OCR: Meet Nougat - The Neural Transformer for Scientific PDFs!.mp42024-06-08 15:43:4844.76 MiB
Emu: The Secret to Generating Stunning Images with Small Data Sets.mp42024-06-08 15:25:2541.71 MiB
Latent Diffusion Models: Revolutionizing High-Resolution Image Synthesis.mp42024-06-08 15:34:4143.19 MiB
Revolutionizing Video Editing: Meta AI's EVE Explained! .mp42024-06-08 16:07:2942.64 MiB
Scaling Vision Transformers: Revealing the Power of Large Models.mp42024-06-08 15:35:0440.11 MiB
GLaMM: Revolutionizing Pixel-Level Grounding in Multimodal Models.mp42024-06-08 15:30:2542.66 MiB
FlexiViT: Transforming Vision Transformers with Adaptive Patch Sizes.mp42024-06-08 15:35:1658.53 MiB
Bringing Portraits to Life: EMO's Audio2Video Diffusion Model.mp42024-06-08 16:03:2961.38 MiB
Unlocking Precision: ControlNet in Text-to-Image Models.mp42024-06-08 15:31:2750.80 MiB
Kandinsky 3.0: The Future of Text-to-Image AI.mp42024-06-08 15:59:0740.76 MiB
Adaptive Sparsity in Transformers Explained!.mp42024-06-08 16:00:0736.92 MiB
How Scaling Instruction-Finetuning Improves Language Models.mp42024-06-08 15:30:3340.35 MiB
Why Weaver Outshines GPT-4 in Creative Writing!.mp42024-06-08 15:48:3034.91 MiB
Ultra High-Fidelity 3D Avatars with Dynamic Gaussians.mp42024-06-08 15:28:3746.92 MiB
Unveiling LLaVA: The Next-Gen Visual Language Assistant.mp42024-06-08 15:23:1438.90 MiB
Unifying Vision and Language: Inside X-Decoder's Breakthrough.mp42024-06-08 15:49:1042.69 MiB
LoraHub: Transforming Task Generalization with Dynamic LoRA Composition.mp42024-06-08 15:36:4345.00 MiB
CapsFusion: Boosting AI with Better Image-Text Data at Scale!.mp42024-06-08 15:24:5840.33 MiB
Ring Attention: Revolutionizing Transformer Memory for Endless Sequences.mp42024-06-08 16:06:4735.96 MiB
Discover Objects in Images with Self-Supervised Transformers: No Labels Needed!.mp42024-06-08 15:21:4938.98 MiB
S-LoRA: Efficiently Serving Thousands of LoRA-Adaptive Models!.mp42024-06-08 15:38:3540.82 MiB
How Prompt Cache is Revolutionizing AI: Faster and Smarter Inference .mp42024-06-08 15:23:2355.84 MiB
Mega: Transforming Long-Sequence Modeling with Gated Attention.mp42024-06-08 15:32:5434.63 MiB
Next-Level Image and Video Generation: Matryoshka Diffusion Models!.mp42024-06-08 15:25:4741.05 MiB
Unified Representation: Language, Images & 3D Point Clouds Explained!.mp42024-06-08 15:23:3148.40 MiB
Panda-70M: Revolutionizing Video Captioning with 70M Clips!.mp42024-06-08 15:34:2456.47 MiB
The Future of Materials Modeling: Introducing FAENet!.mp42024-06-08 16:08:0037.91 MiB
OmniACT: Revolutionizing Multimodal AI for Desktop & Web Tasks! .mp42024-06-08 15:35:4939.05 MiB
Vidu: Next-Level Text-to-Video Generation with Diffusion Models .mp42024-06-08 15:45:3652.52 MiB
Speeding Up Transformers: The Power of SwitchHead's MoE Attention!.mp42024-06-08 15:51:0034.54 MiB
Sora Unveiled: Transforming Text into Dynamic Videos.mp42024-06-08 16:04:4149.47 MiB
Animate Anyone: Revolutionary Image-to-Video Synthesis.mp42024-06-08 15:46:0046.13 MiB
SDXL: A New Benchmark in High-Resolution Image Synthesis .mp42024-06-08 16:00:1140.21 MiB
LucidDreamer: Revolutionizing Domain-Free 3D Scene Generation .mp42024-06-08 15:22:5732.38 MiB
Dynamic Typography: How Text Comes Alive with AI Animation!.mp42024-06-08 15:39:0455.68 MiB
The Future of Large Language Models: From Training to Deployment🚀.mp42024-06-08 15:45:4338.62 MiB
MoE-Mamba: Revolutionizing Language Models with Efficiency and Scalability.mp42024-06-08 15:22:5134.74 MiB
Unifying Visual and Language Models: Meet CoCa!.mp42024-06-08 15:26:3746.97 MiB
ReVideo: Revolutionizing Video Editing with Motion and Content Control.mp42024-06-08 15:54:0837.43 MiB
Tracking 2D Pixels in 3D Space: The Future of Motion Estimation.mp42024-06-08 15:24:2338.04 MiB
Reinventing Image Quantization: A Deep Dive into TE-VQGAN.mp42024-06-08 15:44:2548.62 MiB
Revolutionizing Robots: The Universal Manipulation Interface!.mp42024-06-08 15:28:3357.45 MiB
EfficientViT: Making Vision Transformers Faster for Real-Time Applications!.mp42024-06-08 15:31:0841.39 MiB
Transforming 3D with TripoSR: Fast Object Reconstruction in 0.5 Seconds!.mp42024-06-08 15:36:0943.18 MiB
BlackMamba: Revolutionizing Language Models with Mixture of Experts & State-Space Models.mp42024-06-08 15:45:1836.03 MiB
SpeechT5: Revolutionizing Spoken Language Processing.mp42024-06-08 15:58:1840.29 MiB
Revolutionizing Atomic Calculations with Spherical Channels 🌐.mp42024-06-08 16:02:1339.64 MiB
TinyGPT-V: Maximizing Efficiency in Multimodal Language Models.mp42024-06-08 15:26:4541.91 MiB
Unlocking Infinite Context: Meet Infini-attention for Transformers!.mp42024-06-08 16:06:1833.06 MiB
Creating Full-length Music with AI: Dive into Latent Diffusion Models.mp42024-06-08 15:50:1933.12 MiB
Unveiling MagicVideo-V2: Stunning High-Aesthetic Video Generation from Text Descriptions.mp42024-06-08 15:51:1236.62 MiB
How LLaMA Enhances Video Question Answering with Temporal and Causal Reasoning.mp42024-06-08 16:08:0729.74 MiB
Palo: Breaking Language Barriers with Multimodal AI for 5 Billion People.mp42024-06-08 15:24:1733.57 MiB
Unlocking Efficiency in Transformers: The Mixture-of-Depths Approach.mp42024-06-08 15:31:1544.79 MiB
Unlocking LLM Power on Consumer GPUs: Meet PowerInfer!.mp42024-06-08 15:21:4545.68 MiB
InstructPix2Pix: Revolutionizing Image Editing with AI Instructions.mp42024-06-08 16:05:3741.63 MiB
Code Meets Math: Unlocking the Genius of Open-Source LLMs with MathCoder.mp42024-06-08 16:02:3836.72 MiB
How In-Context Learning Creates Task Vectors: Explained!.mp42024-06-08 15:42:2242.71 MiB
Revolutionizing Video Generation: Unveiling VLOGGER for Realistic Avatars.mp42024-06-08 15:21:5751.32 MiB
Democratizing Autonomous Driving with DriverGym!.mp42024-06-08 15:37:2840.58 MiB
Simplifying VQ-VAEs: The Power of Finite Scalar Quantization.mp42024-06-08 15:28:2826.01 MiB
Hyper-SD: Revolutionizing Image Synthesis with Trajectory Segmented Consistency.mp42024-06-08 16:06:5436.73 MiB
MoEUT: Revolutionizing Universal Transformers with Mixture-of-Experts.mp42024-06-08 15:57:2341.15 MiB
Unlocking the Power of Multi-modality: Mini-Gemini Explained.mp42024-06-08 15:23:5544.94 MiB
Unlocking Knowledge: The Power of Retrieval-Augmented Generation.mp42024-06-08 15:46:5736.78 MiB
MegaScale: Unleashing LLM Training on 10,000+ GPUs! .mp42024-06-08 15:56:5248.03 MiB
Unlocking Vision Power: Bottleneck Transformers Explained!.mp42024-06-08 15:48:1340.88 MiB
LCM-LoRA: Boosting Text-to-Image Generation Efficiency!.mp42024-06-08 15:49:2538.65 MiB
How Recurrent Memory Revolutionizes Long Document Processing.mp42024-06-08 15:59:1837.05 MiB
Revolutionizing AI Training: ReSTEM Uses Model-Generated Data to Outshine Human Inputs.mp42024-06-08 16:01:2848.67 MiB
WebVoyager: Revolutionizing Web Navigation with AI-Powered Multimodal Models.mp42024-06-08 15:52:1040.01 MiB
CroCo: Revolutionizing 3D Vision with Cross-View Completion.mp42024-06-08 15:24:0241.99 MiB
Unlocking the Full Potential of Language Models: DAPT and TAPT Explained!.mp42024-06-08 16:02:0645.71 MiB
Decoding Time Series: Unveiling the Magic of TimeX.mp42024-06-08 15:26:3044.20 MiB
AdaMod: The Next-Gen Algorithm for Deep Learning Stability.mp42024-06-08 15:56:3633.25 MiB
Unlocking BART: The Game-Changer for Language Models.mp42024-06-08 15:43:0439.47 MiB
SuperGlue: Revolutionizing Feature Matching with Graph Neural Networks.mp42024-06-08 15:29:0945.27 MiB
How MotionLLM is Revolutionizing Human Behavior Understanding!.mp42024-06-08 16:06:4041.85 MiB
Breaking Boundaries: InternLM-XComposer2-4KHD's Mastery of High-Resolution Vision-Language Tasks .mp42024-06-08 15:33:0950.21 MiB
Implicit Self-Improvement for AI: A Game Changer in Training Large Language Models.mp42024-06-08 15:35:2042.25 MiB