2025计算机视觉论文综述汇总-Linux大棚

admin 管理员组

文章数量: 1184232

2025.5.23 分类由AI生成

遥感图像处理

遥感基础模型：
A Survey on Remote Sensing Foundation Models-- From Vision to Multimodality --2503.22081v1.pdf
MIMRS-- A Survey on Masked Image Modeling in Remote Sensing – 2504.03181v2.pdf
Vision Mamba in Remote Sensing-- A Comprehensive Survey of Techniques, Applications and Outlook – 2505.00630v2.pdf
遥感图像变化检测：
A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing-- Tasks, Strategies, and Challenges – 2502.02835v1.pdf
SAR 舰船分类：
A Survey on SAR ship classification using Deep Learning – 2503.11906v1.pdf
卫星AI图像处理：
Advancing Earth Observation-- A Survey on AI-Powered Image Processing in Satellites – 2501.12030v1.pdf
精准农业：
Vision Transformers in Precision Agriculture-- A Comprehensive Survey – 2504.21706v2.pdf
A survey of datasets for computer vision in agriculture – 2502.16950v1.pdf

医学图像处理

医学图像重建：
A Comprehensive Survey on Magnetic Resonance Image Reconstruction – 2503.07097v1.pdf
A Survey of fMRI to Image Reconstruction – 2502.16861v1.pdf
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli – 2503.15978v1.pdf
医学图像分割：
Recent Advances in Medical Imaging Segmentation-- A Survey – 2505.09274v1.pdf
Self-Supervised Learning for Image Segmentation: A Comprehensive Survey – 2505.13584v1.pdf 总结
医学图像到网格重建：
From Pixels to Polygons-- A Survey of Deep Learning Approaches for Medical Image-to-Mesh Reconstruction – 2505.03599v1.pdf
医学图像深度学习（通用）：
Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability-- A Comprehensive Survey – 2504.11588v1.pdf
病理学基础模型：
A Survey of Pathology Foundation Model-- Progress and Future Directions – 2504.04045v2.pdf
A Survey on Computational Pathology Foundation Models-- Datasets, Adaptation Strategies, and Evaluation Tasks – 2501.15724v2.pdf
计算神经影像学：
Diffusion Models for Computational Neuroimaging-- A Survey – 2502.06552v1.pdf
视网膜成像/眼科学：
The Eye as a Window to Systemic Health-- A Survey of Retinal Imaging from Classical Techniques to Oculomics – 2505.04006v1.pdf
医学图像中的 Transformers：
Transformers in Medical Imaging-- A Survey – 2201.09873v1.pdf

点云处理 / 3D 视觉

3D 配准：
3D Registration in 30 Years-- A Survey – 2412.13735v2.pdf
3D 场景生成：
3D Scene Generation-- A Survey – 2505.05474v1.pdf
事件驱动3D重建：
A Survey on Event-driven 3D Reconstruction-- Development under Different Categories – 2503.19753v2.pdf
自动驾驶中的学习型3D重建：
Learning-based 3D Reconstruction in Autonomous Driving-- A Comprehensive Survey – 2503.14537v2.pdf
3D 点云基础模型：
Foundational Models for 3D Point Clouds-- A Survey and Outlook – 2501.18594v1.pdf
辐射场隐式和显式表示：
Editing Implicit and Explicit Representations of Radiance Fields-- A Survey – 2412.17628v1.pdf
神经辐射场 (NeRF)：
Neural Radiance Fields for the Real World-- A Survey – 2501.13104v1.pdf
点云场景分割：
Point Cloud Based Scene Segmentation-- A Survey – 2503.12595v1.pdf
点云语义信息隐式引导与显式表示：
Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud-- A Survey – 2501.05473v1.pdf

视频处理

视频生成：
A Comprehensive Survey on Generative AI for Video-to-Music Generation – 2502.12489v1.pdf
A Survey of Interactive Generative Video – 2504.21853v1.pdf
Survey of Video Diffusion Models-- Foundations, Implementations, and Applications – 2504.16081v1.pdf
Exploring the Evolution of Physics Cognition in Video Generation-- A Survey – 2503.21765v1.pdf
视频到音乐生成：
Vision-to-Music Generation-- A Survey – 2503.21254v1.pdf
动作质量评估：
A Comprehensive Survey of Action Quality Assessment-- Method and Benchmark – 2412.11149v1.pdf
A Decade of Action Quality Assessment-- Largest Systematic Survey of Trends, Challenges, and Future Directions – 2502.02817v1.pdf
动作识别：
SMART-Vision-- Survey of Modern Action Recognition Techniques in Vision – 2501.13066v1.pdf
云边端协同系统中的视频分析：
A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems – 2502.06581v4.pdf
4D 生成：
Advances in 4D Generation-- A Survey – 2503.14501v2.pdf
3D 理解中的具身智能（视频相关）：
Embodied Intelligence for 3D Understanding-- A Survey on 3D Scene Question Answering – 2502.00342v1.pdf
SAM2 图像和视频分割：
SAM2 for Image and Video Segmentation-- A Comprehensive Survey – 2503.12781v1.pdf

普通图像处理与计算机视觉

图像增强：
A Comprehensive Survey on Image Signal Processing Approaches for Low-Illumination Image Enhancement – 2502.05995v1.pdf
Underwater Image Enhancement using Generative Adversarial Networks-- A Survey – 2501.06273v1.pdf
Survey on Single-Image Reflection Removal using Deep Learning Techniques – 2502.08836v1.pdf
图像质量评估：
A Survey on Image Quality Assessment-- Insights, Analysis, and Future Outlook – 2502.08540v1.pdf
图像分割：
SAM2 for Image and Video Segmentation: A Comprehensive Survey – 2503.12781v1.pdf
目标检测：
Small Object Detection-- A Comprehensive Survey on Challenges, Techniques and Real-World Applications – 2503.20516v1.pdf
边缘检测：
Hybrid Multi-Stage Learning Framework for Edge Detection-- A Survey – 2503.21827v1.pdf
图像反演：
Image Inversion-- A Survey from GANs to Diffusion and Beyond – 2502.11974v1.pdf
图像识别：
Image Recognition with Online Lightweight Vision Transformer-- A Survey – 2505.03113v2.pdf
云服务中面部图像隐私保护：
A Survey on Facial Image Privacy Preservation in Cloud-Based Services – 2501.08665v1.pdf
生成对抗网络 (GANs)：
Generative Adversarial Networks with Limited Data-- A Survey and Benchmarking – 2504.05456v1.pdf
工业异常合成：
A Survey on Industrial Anomalies Synthesis – 2502.16412v1.pdf
手写文本识别：
Handwritten Text Recognition-- A Survey – 2502.08417v1.pdf
单目度量深度估计：
Survey on Monocular Metric Depth Estimation – 2501.11841v3.pdf
基于事件的成像：
From Events to Enhancement-- A Survey on Event-Based Imaging Technologies – 2505.05488v1.pdf
A Survey on Event-based Optical Marker Systems – 2504.20736v1.pdf
Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices-- A Survey – 2503.22943v2.pdf
视觉数据数字孪生生成：
Digital Twin Generation from Visual Data-- A Survey – 2504.13159v1.pdf
时尚新品性能预测：
New Fashion Products Performance Forecasting-- A Survey on Evolutions, Models and Emerging Trends – 2501.10324v1.pdf

大型视觉-语言模型 (LVLMs) / 多模态 AI

LVLM 对齐与安全：
A Survey of Safety on Large Vision-Language Models-- Attacks, Defenses and Evaluations – 2502.14881v1.pdf
A Survey of State of the Art Large Vision Language Models-- Alignment, Benchmark, Evaluations and Challenges – 2501.02189v6.pdf
Large Vision-Language Model Alignment and Misalignment-- A Survey Through the Lens of Explainability – 2501.01346v2.pdf
Aligning Multimodal LLM with Human Preference-- A Survey – 2503.14504v2.pdf
When Data Manipulation Meets Attack Goals-- An In-depth Survey of Attacks for VLMs – 2502.06390v2.pdf
高效 LVLM：
A Survey on Efficient Vision-Language Models – 2504.09724v1.pdf
Small Vision-Language Models-- A Survey on Compact Architectures and Techniques – 2503.10665v1.pdf
Vision-Language Models for Edge Networks-- A Comprehensive Survey – 2502.07855v1.pdf
LVLM 集成与 LLM 视觉能力赋能：
Efficiently Integrate Large Language Models with Visual Perception-- A Survey from the Training Paradigm Perspective – 2502.01524v1.pdf
How Vision-Language Tasks Benefit from Large Pre-trained Models-- A Survey – 2412.08158v1.pdf
How to Enable LLM with 3D Capacity-- A Survey of Spatial Reasoning in LLM – 2504.05786v1.pdf
多模态思维链推理：
Multimodal Chain-of-Thought Reasoning-- A Comprehensive Survey – 2503.12605v2.pdf
多模态数据增强：
Multimodal Large Language Models for Image, Text, and Speech Data Augmentation-- A Survey – 2501.18648v2.pdf
多模态融合与机器人视觉：
Multimodal Fusion and Vision-Language Models-- A Survey for Robot Vision – 2504.02477v1.pdf
多模态生成模型：
Simulating the Real World-- A Unified Survey of Multimodal Generative Models – 2503.04641v1.pdf
下一词预测迈向多模态智能：
Next Token Prediction Towards Multimodal Intelligence-- A Comprehensive Survey – 2412.18619v2.pdf
大型多模态模型数据集、应用类别与分类：
Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy – 2412.17759v1.pdf
AI 生成媒体检测（与 MLLM 相关）：
Survey on AI-Generated Media Detection-- From Non-MLLM to MLLM – 2502.05240v2.pdf
视觉问答：
A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems-- The Lifecycle of Knowledge in Visual Reasoning Task – 2504.17547v1.pdf
Visual question answering-- from early developments to recent advances – a survey – 2501.03939v2.pdf
视觉 grounding：
Towards Visual Grounding-- A Survey – 2412.20206v1.pdf

生成式 AI（超越文本到图像/视频）

生成式 AI 用于电影创作：
Generative AI for Film Creation-- A Survey of Recent Advances – 2504.08296v1.pdf
生成式 AI 用于赛璐珞动画：
Generative AI for Cel-Animation-- A Survey – 2501.06250v1.pdf
生成式 AI 用于角色动画：
Generative AI for Character Animation-- A Comprehensive Survey of Techniques, Applications, and Future Directions – 2504.19056v1.pdf
生成式物理 AI：
Generative Physical AI in Vision-- A Survey – 2501.10928v2.pdf
深度生成模型个性化图像生成：
Personalized Image Generation with Deep Generative Models-- A Decade Survey – 2502.13081v1.pdf

扩散模型

扩散模型中的概念擦除：
A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models – 2502.14896v1.pdf
扩散模型中的视觉概念挖掘：
A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models – 2503.13576v1.pdf
扩散模型中的注意力机制：
Attention in Diffusion Model-- A Survey – 2504.03738v1.pdf
扩散模型上的偏好对齐：
Preference Alignment on Diffusion Model-- A Comprehensive Survey for Image Generation and Editing – 2502.07829v1.pdf
文本到图像生成与编辑（扩散模型）：
Text to Image Generation and Editing-- A Survey – 2505.02527v1.pdf

计算机视觉架构与技术（通用）

深度 CNN：
A Comprehensive Survey on Architectural Advances in Deep CNNs-- Challenges, Applications, and Emerging Research Directions – 2503.16546v1.pdf
视觉 Transformers：
Vision Transformers on the Edge-- A Comprehensive Survey of Model Compression and Acceleration Strategies – 2503.02891v3.pdf
Mamba 架构在视觉应用中：
A Survey on Mamba Architecture for Vision Applications – 2502.07161v1.pdf
动态神经网络：
A Survey on Dynamic Neural Networks-- from Computer Vision to Multi-modal Sensor Fusion – 2501.07451v1.pdf
对抗性防御：
A Survey of Adversarial Defenses in Vision-based Systems-- Categorization, Methods and Challenges – 2503.00384v1.pdf
全向视觉中的表示学习：
A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision – 2502.10444v1.pdf
自监督对比学习：
A Survey on Data Curation for Visual Contrastive Learning-- Why Crafting Effective Positive and Negative Pairs Matters – 2502.08134v1.pdf
A Survey on Self-supervised Contrastive Learning for Multimodal Text-Image Analysis – 2503.11101v2.pdf
序数回归：
A Survey on Ordinal Regression-- Applications, Advances and Prospects – 2503.00952v1.pdf
类别无关计数：
A Survey on Class-Agnostic Counting-- Advancements from Reference-Based to Open-World Text-Guided Approaches – 2501.19184v3.pdf
小样本不平衡问题：
A Survey on Small Sample Imbalance Problem-- Metrics, Feature Analysis, and Solutions – 2504.14800v1.pdf
域外检测：
Recent Advances in Out-of-Distribution Detection with CLIP-Like Models-- A Survey – 2505.02448v1.pdf
鲁棒非可转移学习：
Toward Robust Non-Transferable Learning-- A Survey and Benchmark – 2502.13593v2.pdf
工业缺陷检测（基础模型）：
A Survey on Foundation-Model-Based Industrial Defect Detection – 2502.19106v2.pdf
人机交互运动生成：
A Survey on Human Interaction Motion Generation – 2503.12763v2.pdf
图像语义描述模型：
A Preliminary Survey of Semantic Descriptive Model for Images – 2501.08352v1.pdf
利用视觉模型进行时间序列分析：
Harnessing Vision Models for Time Series Analysis-- A Survey – 2502.08869v1.pdf
设备上基于视觉的裂缝检测的量化技术：
Survey of Quantization Techniques for On-Device Vision-based Crack Detection – 2502.02269v1.pdf
视觉输入手势识别：
Survey on Hand Gesture Recognition from Visual Input – 2501.11992v1.pdf
视觉基础模型的解释性：
Explainability for Vision Foundation Models-- A Survey – 2501.12203v1.pdf
组合图像检索：
A Comprehensive Survey on Composed Image Retrieval – 2502.18495v2.pdf
Composed Multi-modal Retrieval-- A Survey of Approaches and Applications – 2503.01334v1.pdf
视觉中的检索增强生成与理解：
Retrieval Augmented Generation and Understanding in Vision-- A Survey and New Outlook – 2503.18016v1.pdf

自动驾驶与机器人

自动驾驶中的世界模型：
A Survey of World Models for Autonomous Driving – 2501.11260v2.pdf
The Role of World Models in Shaping Autonomous Driving-- A Comprehensive Survey – 2502.10498v1.pdf
自动驾驶中的联合感知与预测：
Joint Perception and Prediction for Autonomous Driving-- A Survey – 2412.14088v1.pdf
室内具身 AI 中的语义建图：
Semantic Mapping in Indoor Embodied AI – A Comprehensive Survey and Future Directions – 2501.05750v1.pdf

其他及专业应用

掌纹识别：
Deep Learning in Palmprint Recognition-A Comprehensive Survey – 2501.01166v1.pdf
第一人称视觉：
Challenges and Trends in Egocentric Vision-- A Survey – 2503.15275v2.pdf
体育中的动作评估：
Action Valuation in Sports-- A Survey – 2504.06163v1.pdf
反无人机方法：
Securing the Skies-- A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions – 2504.11967v2.pdf
360 度全景图生成：
A Survey on Text-Driven 360-Degree Panorama Generation – 2502.14799v1.pdf
VideoLLM 基准与评估：
VideoLLM Benchmarks and Evaluation-- A Survey – 2505.03829v1.pdf

本文标签：视觉计算机论文

版权声明：本文标题：2025计算机视觉论文综述汇总内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.roclinux.cn/b/1766500050a3464218.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

Linux大棚 – 不忘初心的技术博客，浮躁时代的安静角落

2025计算机视觉论文综述汇总

更多相关文章

虚拟机里提示计算机内存不足,windows下打开VMware虚拟机时提示内存不足的处理方法...

苹果手机怎样用计算机,苹果手机怎么连接电脑,详细教您怎么使用苹果手机连接电脑...

苹果计算机 win10,苹果怎么装win10苹果装win10详细教程【图文】

计算机第一级开机密码设置,电脑如何设置开机密码 电脑开机密码设置方法

计算机密码设置要求包括哪些内容,电脑开机密码设置方法有哪些

计算机xp bios密码设置方法,电脑开机密码怎么设置_教您各系统设置方法

计算机开机高级设置密码,给电脑设置开机密码

老式计算机如何设置u盘启动,技嘉主板老式bios设置u盘启动教程

win7资源管理器从计算机开始,熟练用Win7电脑从Win7资源管理器入门

win7无法访问win10计算机,win7系统局域网不能访问怎么办

计算机开机显示器不亮,显示器不亮了怎么回事_电脑开机显示器不亮怎么回事-win7之家...

计算机提示网络不可用,Windows电脑系统显示无线网络不可用怎么办?

windows安装程序无法将windows配置为在此计算机的硬件上运行解决方法

进入安全模式后重新启动计算机,进入Win7安全模式方法一：开机按F8键进入 我们在重启或者电脑开机的时候...

计算机正在重新启动,重装 Windows 系统时突然出现“计算机意外地重新启动或遇到错误...”提示怎么办？...

如何更改计算机睿频,设置睿频加速功能在win7中实现加速的步骤

winxp如何创建计算机工作组,windowsxp系统创建或加入计算机工作组的两种方法

探索不同子网的秘密：192.168.1.124与192.168.1.223如何无缝对接

'添加打印机'按钮变冷淡？Win7下激活后台打印服务的实用技巧！

探索系统视觉：解读计算机、磁盘驱动器和文件的图标

发表评论

推荐文章

打造梦幻工作站：天梯CPU排行榜下的PC开发秘籍

在 树莓派4 上 USB 启动_树莓派 usb启动

ROS初始化 sudo rosdep init失败_sudo: rosdep: command not found

由于找不到d3dx9_26.dll文件导致游戏软件无法运行启动问题_极品飞车启动时提示计算机中丢失d3dx9-26dll怎么回事

笔记本电脑作为WiFi热点

热门文章

定义一个名为Vehicles 交通工具 的基类 该类中应包含String类型的成员属性brand 商标 和color 颜色 还应包含成员方法showInfo 显示信息_c++定义一个名为vehicles(交通工具)的基类,该类中应包含string类型的数据

电脑上打开iTunes产生数据库文件和影像数据的一些问题_itunes数据库不完整 红雪

android监听Home键_androidhomelistener

如何DIY一台属于你自己的电脑？_怎样自己diy一台电脑

电脑自动重启是什么原因？重启原因排查和解决办法！_电脑闪退重启 是什么原因

ESP32-VSCODE环境下添加组件，并解决头文件无法找到问题_esp-idf 头文件找不到

笔记本电脑wifi小图标不见了 或者 蓝牙功能消失、电脑开不开机解决方法_华为笔记本不小心把wi-fi驱动删了,如何恢复?

笔记本电脑没有声音？几招恢复声音流畅！_笔记本没声音了如何恢复

电脑声音修复？【图文详解】电脑没有声音？声音异常

一文解密Dism++：卸载驱动的超高效方法

最新文章

一文教会你AIX系统备份：mksysb实用指南

SWF文件备份失败？这些步骤让你轻松搞定

Win10系统备份轻松搞定：掌握captureimage命令的关键技巧

Linux系统安全小贴士：掌握备份与恢复，安心每一天

省时省心！三步完成电脑系统高效备份！

Ubuntu系统维护秘籍：备份步骤详解，保护你的劳动成果！

Linux系统不哭：高效备份与快速恢复方案

Ubuntu系统安全大计，备份技巧大公开

GHOST教程：系统备份和还原，小白也能变成高手！

Linux备份与恢复必修课：SWF文件安全策略从入门到精通

Exploring the Finest Accommodations: A Comprehensive Guide to Ruston LA Hotels

The Enchanting Experience of ScaliniTella NYC: A Culinary Gem in the Heart of Manhattan

Exploring the Exquisite Aloft Chicago O'Hare: A Blend of Modern Luxury and Convenience

A Culinary Journey: Discovering the Finest Dining Experiences in Waco, TX

A Culinary Journey: Discovering the Finest Dining Experiences in Athens, GA

电脑设备管理器在哪里？一次让我抓狂又兴奋的寻找经历

与GWX的持久战：一段关于Windows10升级弹窗的私人记忆

以管理员身份运行：那些年我们追过的权限与踩过的坑

计算机第一级开机密码设置,电脑如何设置开机密码电脑开机密码设置方法

进入安全模式后重新启动计算机,进入Win7安全模式方法一：开机按F8键进入我们在重启或者电脑开机的时候...

在树莓派4 上 USB 启动_树莓派 usb启动

定义一个名为Vehicles 交通工具的基类该类中应包含String类型的成员属性brand 商标和color 颜色还应包含成员方法showInfo 显示信息_c++定义一个名为vehicles(交通工具)的基类,该类中应包含string类型的数据

电脑上打开iTunes产生数据库文件和影像数据的一些问题_itunes数据库不完整红雪

电脑自动重启是什么原因？重启原因排查和解决办法！_电脑闪退重启是什么原因

笔记本电脑wifi小图标不见了或者蓝牙功能消失、电脑开不开机解决方法_华为笔记本不小心把wi-fi驱动删了,如何恢复?