publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. arXiv
    DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
    Khan, Zaid, Stengel-Eskin, Elias, Cho, Jaemin, and Bansal, Mohit
    arXiv preprint arXiv:2410.06215 2024
  2. CVPR
    Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
    Khan, Zaid, Kumar BG, Vijay, Schulter, Samuel, Fu, Yun, and Chandraker, Manmohan
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  3. CVPR
    Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
    Khan, Zaid, and Fu, Yun
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

2023

  1. NeurIPS
    Exploring Question Decomposition for Zero-Shot VQA
    Khan, Zaid, Kumar BG, Vijay, Schulter, Samuel, Chandraker, Manmohan, and Fu, Yun
    In Proceedings of the 36th International Conference on Neural Information Processing Systems (NeurIPS) 2023
  2. CVPR
    Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
    Khan, Zaid, BG, Vijay Kumar, Schulter, Samuel, Yu, Xiang, Fu, Yun, and Chandraker, Manmohan
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023
  3. ICLR
    Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning
    Khan, Zaid, and Fu, Yun
    In The Eleventh International Conference on Learning Representations 2023

2022

  1. ECCV
    Single-Stream Multi-Level Alignment for Vision-Language Pretraining
    Khan, Zaid, BG, Vijay Kumar, Yu, Xiang, Schulter, Samuel, Chandraker, Manmohan, and Fu, Yun
    In European Conference on Computer Vision 2022

2021

  1. ACM MM
    Exploiting BERT for Multimodal Target Sentiment Classification Through Input Space Translation
    Khan, Zaid, and Fu, Yun
    In ACM Conference on Multimedia 2021
  2. ACM FAccT
    One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision
    Khan, Zaid, and Fu, Yun
    In ACM Conference on Fairness, Accountability, and Transparency 2021
  3. IEEE TMM
    Families In Wild Multimedia (FIW MM): A Multi-Modal Database for Recognizing Kinship
    Robinson, Joseph P., Khan, Zaid, Yin, Yu, Shao, Ming, and Fu, Yun
    arXiv:2007.14509 [cs] 2021

2020

  1. IEEE FG
    Recognizing Families in the Wild (RFIW): The 4th Edition
    Robinson, J. P., Yin, Y., Khan, Z., Shao, M., Xia, S., Stopa, M., Timoner, S., Turk, M. A., Chellappa, R., and Fu, Y.
    In 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020) (FG) 2020