首页
技术日记
编程
旅游
数码
登录
标签
Multimodal
A Survey of Multimodal Large Language Model from A Data-centric Perspective
本文是LLM系列文章,针对《A Survey of Multimodal Large Language Model from A Data-centric Perspective》的翻译。以数据为中心的多模态大型语言模型综述 摘要 1
Large
Language
Survey
Multimodal
centric
admin
3月前
45
0
【论文阅读】CentralNet: a Multilayer Approach for Multimodal Fusion
CentralNet相比于Concatenate的创新点 Concate的方法相当于在各自模态的特征分别独立抽取之后做融合,但是不干预特征抽取的过程。这显然会漏掉一些不同模态之间的相关性的信息,
论文
CentralNet
Multilayer
fusion
Multimodal
admin
4月前
50
0
【文献阅读】A Comprehensive Review of Multimodal Large Language Models
一、回顾MLLMs 在语言、图像、视频和音频处理等多模态任务中表现出色。这些模型通过整合多模态信息来增强多模态任务的有效性。在自然语言处理(NLP)任务中,如文本生成和机
文献
review
Comprehensive
Multimodal
Language
admin
6月前
128
0
Comprehensive Multimodal Segmentation in Medical Imaging
作者未提供代码
Multimodal
Comprehensive
Segmentation
Imaging
Medical
admin
6月前
81
0
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Surve
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends
Multimodal
Large
Abilities
EXPLORING
Reasoning
admin
6月前
99
0
A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
本文是LLM系列文章,针对《Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences》的翻译
Multimodal
Large
Comprehensive
Benchmark
Language
admin
6月前
96
0
AGI之MFM:《Multimodal Foundation Models: From Specialists to General-Purpose Assistants多模态基础模型:从专家到通用助
AGI之MFM:《Multimodal Foundation Models: From Specialists to General-Purpose Assistants多模态基础模型:从专家到通
模型
多模
基础
专家
Multimodal
admin
7月前
97
0