关注
Xueyan Zou
Xueyan Zou
PhD Student at UW-Madison
在 wisc.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Segment everything everywhere all at once
X Zou*, J Yang*, H Zhang*, F Li*, L Li, J Wang, L Wang, J Gao^, YJ Lee^
arXiv preprint arXiv:2304.06718, 2023
2202023
Generalized Decoding for Pixel, Image, and Language
X Zou*, ZY Dou*, J Yang*, Z Gan, L Li, C Li, X Dai, H Behl, J Wang, ...
CVPR 2023, 2022
1422022
Delving deeper into anti-aliasing in convnets
X Zou, F Xiao, Z Yu, YJ Lee
BMVC 2020, IJCV, 2020
912020
A simple framework for open-vocabulary segmentation and detection
H Zhang, F Li, X Zou, S Liu, C Li, J Yang, L Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
652023
Semantic-sam: Segment and recognize anything at any granularity
F Li, H Zhang, P Sun, X Zou, S Liu, J Yang, C Li, L Zhang, J Gao
arXiv preprint arXiv:2307.04767, 2023
622023
Progressive Temporal Feature Alignment Network for Video Inpainting
X Zou, L Yang, D Liu, YJ Lee
CVPR 2021, 16448-16457, 2021
512021
Set-of-mark prompting unleashes extraordinary visual grounding in gpt-4v
J Yang, H Zhang, F Li, X Zou, C Li, J Gao
arXiv preprint arXiv:2310.11441, 2023
462023
Llava-plus: Learning to use tools for creating multimodal agents
S Liu, H Cheng, H Liu, H Zhang, F Li, T Ren, X Zou, J Yang, H Su, J Zhu, ...
arXiv preprint arXiv:2311.05437, 2023
262023
Llava-grounding: Grounded visual chat with large multimodal models
H Zhang, H Li, F Li, T Ren, X Zou, S Liu, S Huang, J Gao, L Zhang, C Li, ...
arXiv preprint arXiv:2312.02949, 2023
62023
Visual In-Context Prompting
F Li, Q Jiang, H Zhang, T Ren, S Liu, X Zou, H Xu, H Li, C Li, J Yang, ...
arXiv preprint arXiv:2311.13601, 2023
22023
End-to-end instance edge detection
X Zou, H Liu, YJ Lee
arXiv preprint arXiv:2204.02898, 2022
22022
Interfacing Foundation Models' Embeddings
X Zou, L Li, J Wang, J Yang, M Ding, Z Yang, F Li, H Zhang, S Liu, ...
arXiv preprint arXiv:2312.07532, 2023
12023
系统目前无法执行此操作,请稍后再试。
文章 1–12