Linux大棚 – 不忘初心的技术博客,浮躁时代的安静角落
  •  首页
  •  技术日记
  •  编程
  •  旅游
  •  数码
  •  登录
  1. 标签
  2. Multimodal
  • A Survey of Multimodal Large Language Model from A Data-centric Perspective

    本文是LLM系列文章,针对《A Survey of Multimodal Large Language Model from A Data-centric Perspective》的翻译。以数据为中心的多模态大型语言模型综述 摘要 1
    Large Language Survey Multimodal centric
    admin 3月前
    45 0
  • 【论文阅读】CentralNet: a Multilayer Approach for Multimodal Fusion

    CentralNet相比于Concatenate的创新点 Concate的方法相当于在各自模态的特征分别独立抽取之后做融合,但是不干预特征抽取的过程。这显然会漏掉一些不同模态之间的相关性的信息,
    论文 CentralNet Multilayer fusion Multimodal
    admin 4月前
    50 0
  • 【文献阅读】A Comprehensive Review of Multimodal Large Language Models

    一、回顾MLLMs 在语言、图像、视频和音频处理等多模态任务中表现出色。这些模型通过整合多模态信息来增强多模态任务的有效性。在自然语言处理(NLP)任务中,如文本生成和机
    文献 review Comprehensive Multimodal Language
    admin 6月前
    128 0
  • Comprehensive Multimodal Segmentation in Medical Imaging

    作者未提供代码
    Multimodal Comprehensive Segmentation Imaging Medical
    admin 6月前
    81 0
  • Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Surve

    Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends
    Multimodal Large Abilities EXPLORING Reasoning
    admin 6月前
    99 0
  • A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

    本文是LLM系列文章,针对《Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences》的翻译
    Multimodal Large Comprehensive Benchmark Language
    admin 6月前
    96 0
  • AGI之MFM:《Multimodal Foundation Models: From Specialists to General-Purpose Assistants多模态基础模型:从专家到通用助

    AGI之MFM:《Multimodal Foundation Models: From Specialists to General-Purpose Assistants多模态基础模型:从专家到通
    模型 多模 基础 专家 Multimodal
    admin 7月前
    97 0
CopyRight © 2022 All Rights Reserved 豫ICP备2021025688号-21
Processed: 0.020 , SQL: 9