多模态理解
场景多模态理解,整合文本、图像、音频、视频等多种模态数据进行综合感知与分析的技术领域,代表模型有 GPT-4o、Claude 3.5 Sonnet
614 次提及390 个连接最近出现: 2026-06-29
关系图谱
关系 (444)
应用于 (295)
GRPO致境 T 系列LongCat-NextAgent边缘计算文档理解MoE阶跃星辰KV CacheQwen3.5-OmniQwen3GPT-5AI写作代码生成AI金融AIUIWan2.7-ImageAI编程助手Chain-of-ThoughtGLM-5视频生成具身智能人形机器人3D生成GLM-5V-TurboAI安全Seedance 2.0ColaOSAI芯片GLM-4人机交互广告推荐Gemma 4Function Calling豆包 Pro机器人TransformerEgoTouch自动驾驶SEMFGEN-1SFT可灵AIAI教育Muse Spark语音合成Claude Opus 4.6SpatialStackMoT-2BVimRAGAI科研助手知识蒸馏即梦Gemini 3.1 Pro数字人AI客服图像生成Gemini 2.5 ProOMNIMEMStarVLALatent SpaceGPT-4o绝对时间戳编码医疗AI视频理解LatentUMBeing-H0.7Vector Database科大讯飞混元3D世界模型2.0具身智能Chance AIChatGPTGemini腾讯混元 HY-World 2.0机器人控制PersonaVLM超级EvaTool UseOpenAIAgentGPT Image 2Kimi MoonshotAI搜索WALL-BQwen3.6-27BKVL 多模态架构GPT-5.5NotebookLMPRETAI芯片AWE3.0DOVE兔展智能MathForge表征收敛深度求索 DeepSeekTIPSv2机器人豆包大模型2.0TokenDuMateMTSSViTMIMMiniCPM-o 4.5SenseNova U1 LiteLangFlow司法矫正校园心理企业招聘OlmoEarthMixture of ExpertsLLaMA 4SenseNova U1DeepSeek V4Chain-of-ThoughtJanus扫地机器人Anthropic深度求索 DeepSeekViF视觉语言模型OCRMSRLDeepSeekGemini 2.0DeepSeek-V4-Flash视觉基元OpenWorldLibCL-Bench LifeChatGPT心理康复支持GPT-5.5 Instant强化学习Janus-Pro-7B隐式推理豆包大模型大模型APIOmni2SoundQwen-2.5-VLGoogle AI Edge Gallery智能穿戴OneTrackerV2DeepSeek V4.1V4.1MCPBARD-VL全双工实时交互TML-Interaction-Smallv-HUBLLaVA-UHD v4MiniCPM-V 4.6视觉 token 压缩ViT 架构重构Omni-FlowViT前置压缩RAGReasonBrainQwen3-VL-8B连续扩散模型Qwen2.5-VL-7BVision-R1SeePhys ProQwen3.7-Plus-PreviewRLSDCoPDHyperEyesMemEye方舟平台Qwen3.7Thinker-Talker 架构Cola DLMGammaGemini OmniVQAQwen3.5-LiveTranslate-FlashAI办公Qwen3.7-MaxRAEv2Gemini 3.5 FlashHeimaESI-BenchAI无障碍Visual Para-ThinkerAI OSHiDream系列大模型符号主义检索增强场景图DeepSeek Sparse AttentionMiMo-V2.5-Pro视觉推理TextPro-SLMStep 3.7 FlashAndesVLBigQuery ObjectRefsMetaSenseNova-U1-8B-MoT-InfographicM3端侧多模态大模型MiniMax M3MSA智能家居MiniMax-01Attention MechanismCosmos 3Gemma 4 12BGemini Omni FlashDreaming稀疏注意力数据标注Qwen3-VL-2BSiri AISiriLanceSiriClaude Fable 5小浣熊Gemini Pro 1.5长上下文Qwen3-VL-4BSenseNova-U1-8B-MoT-InterleavedKimi K2.7 Code悟界·Emu3.5GaussianDWMQwen3-VLDoubao Seed 1.6Emu3.5Sora-2星火X2-VLUniTouch自监督表征学习Representation ForcingLatent ReasoningClue-Guided QA GenerationQwen2.5-Omni-7BVITA-1.5-7BQwen3-Omni-30BLivisMiniMax-M3VL-JEPAA-TPT时间景观临界闪烁融合阈值Apache IcebergGPT 5.5DeepSeek V3GLM-5.2Claude Opus 4.7MMMUGemini 3.5 ProAudioX-TurboUnisonMindStellaris-VL-0.8BMOSSSAGGraphRAG豆包大模型2.1Seed 2.1 ProSeed2.1豆包2.1ProSeed Audio 1.0豆包 2.1 ProOpus 4.8DepthVLM清研精准小微NEO-ov统一自回归框架Information BottleneckSeed-2.1Seed 2.1自动驾驶低秩自适应微调世界模型TransPrune火山引擎 LAS豆包 Seed 2.1 ProSenseNova-U1 Pro5G-AOctopusU6GHzMoKusAI办公FlinkStream Memory持续微调VLX-FlowWan Streamer v0.1YodaOSVLAOMG-DiTMOSS-VL扩散模型
使用技术 (129)
SpatialPointDM0模型Qwen3-VL-2BPixVerse V6Qwen3.5-OmniDeepSeek V4Agent腾讯Qwen2.5GLM-5Seedance 2.0豆包大模型Qwen3Qwen-VLGemini 2.0Gemma 4Wan2.7-VideoLLMGen DAS Dex万相2.7通义千问 MaxRF-GPTAGIBOT WORLD 2026DeepSeek V3SekoTransformerGenie Sim 3.0Muse SparkClaude Mythos有道宝库OmniVTA豆包 Pro商汤如影GPT-4oInfiniClaw BoxCPMaster系统Genie SimRT-MeshKimi MoonshotGen DexMEgo系列CutClaw精灵G2系列Claude Opus 4.6GPT Image 2π0.7Claude DesignGO-3UMI-FTEgo4DOpenMAIC多维视界SenseNova-SIGenie Envisioner 2.0Qwen-Omni饕餮.skillAWE3.0可灵3.0DuMateTIPSv2Xiaomi-Robotics-0GenFlow 4.0VideoAuto-R1LIVRUEQManagerDeepSeek V4-ProMiniCPM-o 4.5JVS Claw洞见人和可灵AI机器人深度求索 DeepSeekDeepSeek-V4-FlashFamiliarBuzzyGENE-26.5豆包大模型1.6阿班AirPods Ultra慧思开物AgentREFORM深度学习Gemini IntelligenceGPT-5Gemini 2.5 ProGooglebook Magic PointerX-OmniClaw医院AI药师StoReelGemini Omni千问录音纪要Tabbit如祺数据平台文心大模型5.1EVE数栈V7.0SPECTREUltrasound-CLIPSemVideoUni-Hand《多模态大模型文本智能白皮书(2026)》千问APP百炼CLIUniMedVLAhaCreatorLLaVA-OneVision-2.0MiniMax M3海尔Seeker套系ChatGPT豆包 AppApple Foundation ModelsSiri AISiriNeuraverse平台Qwen3.6-Plus办公小浣熊桌面端 2.0SpaceMind开悟世界模型 Kairos讯飞40克AI翻译眼镜AI巡检智能体开发平台SAFEPATH灵心巧手亲密交互大模型Gemini清研精准TRAEdoubao-seed-2-1-pro-260628WALL-BClipto.AI
竞争 (1)
发布 (1)
领导 (1)
投资 (1)
相关文章 (614)
下滑加载更多...(已显示 30 / 614)