当前位置: 首页 > news >正文

CVPR2023论文整理

文章目录

  • CVPR2023
    • 一. Vision and Language / Multimodal


CVPR2023

根据官方信息统计,今年共收到 9155 份提交,比去年增加了 12%,创下新纪录,今年接收了 2360 篇论文,接收率为 25.78%。作为对比,去年有 8100 多篇有效投稿,大会接收了 2067 篇,接收率为 25%。

https://cvpr2023.thecvf.com/Conferences/2023/AcceptedPapers

现在根据关键词,对自己感兴趣的方向进行规整以及分类(有筛选)

一. Vision and Language / Multimodal

论文名简介
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Task Residual for Tuning Vision-Language Models
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A Self-Train on Unlabeled Images!
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining
Open-set Fine-grained Retrieval via Prompting Vision-Language Evaluator
Image as a Foreign Language BEiT Pretraining for Vision and Vision-Language Tasks
FashionSAP: Symbols and Attributes Prompt for Fine-grained Fashion Vision-Language Pre-training
Accelerating Vision-Language Pretraining with Free Language Modeling
Leveraging per Image-Token Consistency for Vision-Language Pre-training
Position-guided Text Prompt for Vision-Language Pre-training
IFSeg: Image-free Semantic Segmentation via Vision-Language Model
Enhanced Multimodal Representation Learning with Cross-modal KD
Efficient Multimodal Fusion via Interactive Prompting
Best of Both Worlds: Multimodal Contrastive Learning with Tabular and Imaging Data
Revisiting Multimodal Representation in Contrastive Learning From Patch and Token embeddings to Finite Discrete Tokens
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Multimodal Prompting with Missing Modalities for Visual Recognition
http://www.lryc.cn/news/60825.html

相关文章:

  • RK3399平台开发系列讲解(中断篇)掌握信号处理
  • 业余爱好者想入门编程,一定远离那些只会说No的家伙,尤其程序员
  • DHCP及中继(UOS)
  • 【Linux】进程的概念
  • 奇舞周刊第490期:WebAssembly 多语言/宿主环境中的使用
  • 【css】使用css实现提示框各种弹出效果。
  • 1685_Excel的几种脚本处理方式
  • Unity中使用struct和class来存储数据的注意事项
  • 共阳(共阴)LED数码管编码交互演示
  • 如何在 TensorFlow 中使用 GPU 加速深度学习计算?
  • RK3568平台开发系列讲解(Linux系统篇)线程 pthread 详解
  • hspJAVA
  • OpenAI-ChatGPT最新官方接口《嵌入向量式文本转换》全网最详细中英文实用指南和教程,助你零基础快速轻松掌握全新技术(五)(附源码)
  • 1042. 不邻接植花
  • Linux FTP服务
  • JavaScript基础入门全解析(下)
  • 【C++初阶】(入门)输入输出
  • 初识Linux+Linux基本指令(一)
  • 部署架构 因为单体架构痛点 升级到微服务架构
  • mapreduce打包提交执行wordcount案例
  • MyBatis(十六)MyBatis使用PageHelper
  • 铁路轨道不平顺数据分析与预测
  • 好家伙,9:00面试,9:06就出来了,问的实在是太...
  • 【MySQL】数据库约束和聚合函数的使用
  • SpringMvcFoundation
  • 从零学习SDK(7)如何打包SDK
  • Python OpenCV 3.x 示例:1~5
  • 葵铭智能面经4.18
  • MyBatis 03 -MyBatis动态SQL与分页插件
  • 4.10、字节序列转换函数