Xuri Ge (葛旭日)[CV]

Best way to reach me → Email: xurigexmu@gmail.com

I am currently a final year Ph.D student at School of Computing Science, University of Glasgow, Scotland, UK and a member of the GAIR-Lab in Information, Data and Analysis (IDA) group. My principal supervisor is Prof. Joemon M Jose and second supervisor is Dr. Gerardo Aragon Camarasa. I received my master's degree from Xiamen University in 2020. My advisors are Prof. Rongrong Ji and Minghui Shi. During my master, I worked in the Laboratory of MAC, Artificial Intelligence Department, School of Informatics, Xiamen University, China.
More recently, my main research attention in computer vision, natural language processing and multimedia, mainly including cross-modal retrieval, facial action unit detection, image captioning, medical image analysis, etc.

  • Multi-modal Representation Learning
  • Computer Vision (CV) and Natural Language Processing (NLP)
  • Multi-modal Information Retrieval

  • --Latest News--

    Publications

      Published:

    1. Xuri Ge, Junchen Fu, Fuhai Chen, Shan An, Nicu Sebe, Joemon M Jose
      Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning. [pdf]
      The 32nd ACM Multimedia Conference (ACM MM24), 2024. (Core Rank A*, CCF-A)
    2. Junchen Fu, Xuri Ge, Xin Xin, Alexandros Karatzoglou, Ioannis Arapakis, Jie Wang, Joemon M. Jose
      IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT. [pdf] [code]
      the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024. (Core Rank A*, CCF-A)
    3. Zijun Long, Xuri Ge, Richard Mccreadie, Joemon M. Jose
      CFIR: Fast and Effective Long-Text To Image Retrieval for Large Corpora. [pdf] [code]
      the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024. (Core Rank A*, CCF-A)
    4. Fuxiang Tao, Xuri Ge, Wei Ma, Anna Esposito, Alessandro Vinciarelli.
      Cross-Data Multilevel Attention for Depression Detection: Analyzing the Interplay Between Read and Spontaneous Speech. [pdf]
      IEEE International Conference on Bioinformatics and Biomedicine(IEEE BIBM 2024), 2024.[CCF B]
    5. Xuri Ge, Songpei Xu, Fuhai Chen, Jie Wang, Guoxin Wang, Shan An, Joemon M. Jose
      3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting. [J] [pdf] [code]
      Information Processing and Management (IP&M), 2024. (IF=8.6, JCR Q1)
    6. Jie Wang, Alexandros Karatzoglou, Ioannis Arapakis, Xin Xin, Xuri Ge, Jose M. Joemon
      Sparks of Surprise: Multi-objective Recommendations with Hierarchical Decision Transformers for Diversity, Novelty, and Serendipity. [pdf]
      33rd ACM International Conference on Information and Knowledge Management (CIKM), 2024. (Core Rank A)
    7. Tong Shi, Xuri Ge, Joemon M Jose, Nicolas Pugeault, Paul Henderson
      Detail-Enhanced Intra-and Inter-modal Interaction for Audio-Visual Emotion Recognition. [Arxiv]
      27th International Conference on Pattern Recognition (ICPR), 2024.
    8. Songpei Xu, Xuri Ge, Chaitanya Kaul, Roderick Murray-Smith.
      HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems. [pdf coming]
      IEEE Conference on Multimedia Expo (ICME), 2024. [CORE Rank A]
    9. Xuri Ge, Joemon M. Jose, Songpei Xu, Xiao Liu, Hu Han
      MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection. [J] [pdf]
      ACM Transactions on Intelligent Systems and Technology (TIST), 2024. (IF=7.2, JCR Q1)
    10. Jie Wang, Bansal Kanha, Arapakis Ioannis, Xuri Ge, Joemon M. Jose
      Empowering Legal Citation Recommendation via Efficient Struction-Tuning of Pre-trained Language Models. [pdf]
      The 46th European Conference on Information Retrieval (ECIR), 2024. [CORE Rank A]
    11. Xuri Ge, Joemon M. Jose, Pengcheng Wang, Arunachalam Iyer, Xiao Liu, Hu Han
      ALGRNet: Multi-Relational Adaptive Facial Action Unit Modelling for Face Representation and Relevant Recognitions. [J] [pdf]
      IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM), 2023.
    12. Xuri Ge, Fuhai Chen, Songpe Xu, Fuxiang Tao, Joemon M. Jose.
      Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval. [pdf]
      Winter Conference on Applications of Computer Vision (WACV2023), 2023.[CORE Rank A]
    13. Fuxiang Tao, Xuri Ge, Wei Ma, Anna Esposito, Alessandro Vinciarelli.
      Multi-Local Attention for Speech-based Depression Detection. [pdf]
      IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.[CCF B]
    14. Songpei Xu, Chaitanya Kaul, Xuri Ge, Roderick Murray-Smith.
      Continuous Interaction with a Smart Speaker via Low-dimensional Embeddings of Dynamic Hand Pose. [pdf]
      IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.[CCF B]
    15. Xuri Ge, Fuhai Chen, Joemon M. Jose, Zhilong Ji, Zhongqin Wu, Xiao Liu.
      Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval. [pdf]
      ACM International Conference on Multimedia (ACM MM), 2021. [CORE Rank A*]
    16. Xuri Ge*✉, Pengcheng Wang*, Hu Han, Joemon M. Jose, Zhonglong Ji, Zhongqin Wu, Xiao Liu.
      Local Global Relational Network for Facial Action Units Recognition. (Long-paper, Full Oral Report) [pdf]
      IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2021. [CORE Rank B] [TH-CPL Rank B]
    17. Fuhai Chen, Rongrong Ji, Jiayi Ji, Xiaoshuai Sun, Baochang Zhang, Xuri Ge, Yongjian Wu, Feiyue Huang, Yan Wang.
      Variational Structured Semantic Inference for Diverse Image Captioning. [pdf] [BibTex]
      The 33th Conference on Neural Information Processing Systems (NeurIPS). 2019. [CORE Rank A*]
    18. Xuri Ge , Fuhai Chen, Chen Shen, Rongrong Ji
      Colloquial Image Captioning. (Oral Report) [pdf] [BibTex]
      IEEE International Conference on Multimedia and Expo (ICME), 2019. [CORE Rank A]

      Pre-print:

    19. Xuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Jie Wang and Joemon M. Jose
      Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching. [Arxiv]
      Under review, 2024.
    20. Fuhai Chen#, Xuri Ge#, Xiaoshuai Sun, Yue Gao, Jianzhuang Liu, Fufeng Chen, Wenjie Li
      Differentiated Relevances Embedding for Group-based Referring Expression Comprehension. [Arxiv]
      Under review, 2023.
    21. Fuxiang Tao, Wei Ma, Xuri Ge, Anna Esposito, Alessandro Vinciarelli
      The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection. [Arxiv]
      Under review, 2023.
    22. Fuhai Chen, Rongrong Ji, Chengpeng Dai, Xuri Ge, Shengchuang Zhang, Xiaojing Ma, Yue Gao.
      Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation. [Arxiv]
    23. Xuri Ge,Xiaoshuai Sun, Zhilong Ji, Pengcheng Wang, Xiao Liu, Zhongqin Wu
      The DenseCap-Guided Attention Network For Image-Text Matching. [pdf coming]
      Under review.

      Patent:

    24. Xuri Ge, Zhilong Ji, Xiao Liu
      检索方法、电子设备及计算机可读介质.
      Published: CN 112287159 A; Num: 202011506349.4
    25. Xuri Ge, Zhilong Ji, Xiao Liu
      弹幕生成方法、装置、电子设备及计算机存储介质.
      Published: CN 112016573 A; Num: 202011112941.6

    Activities

    • Conference Reviewer: NeurIPS, ICLR, ACM Multimedia, WWW, CIKM, WACV, AISTATS, ICME, BMVC, ECIR, FG, ICASSP, etc.
    • Journal Reviewer: International Journal of Computer Vision(IJCV), IEEE TRANSACTIONS ON MULTIMEDIA (TMM), Transactions on Knowledge and Data Engineering(TKDE), Pattern Recognition(PR), The ACM Transactions on Information Systems (TOIS), Information Processing and Management(IP&M), Multimedia Systems, etc.
    • 中国图象图形学学会(CSIG)会员.
    • Organizing committee of 3D Multimedia Analytics, Search and Generation (3DMM 2024) in ICME 2024 workshop (Link).
    • Organizing committee of AutoGen-CDR19 challenge in MICCAI 2019 (Link).
    • 1st Prize (team name: MAC-Group), award on Workshop of Automatic Generation of Cardiovascular Diagnostic Report, The 22th Medical Image Computing Computer Assisted Intervention (MICCAI 2019), 2019.
    • 5th Prize (team name: SenseTime, method name: GraphLayout), award on ICDAR 2019 Robust Reading Challenge.

    Working Experiences

    Msc Supervision, School of Computing Science, University of Glasgow Tutor, School of Computing Science, University of Glasgow, UK
    • Summer 2021, Teaching assistant of “Text as Data (Master)”, University of Glasgow.
    • Summer 2021, Teaching assistant of “Web Science (M.)”, University of Glasgow.
    • Spring 2022, Tutor of “Text as Data (M.)”, University of Glasgow.
    • Spring 2022, Tutor of “Web Science (M.)”, University of Glasgow.
    • Spring 2022, Tutor of “Information Visualisation (M.)”, University of Glasgow.
    • Winter 2022, Tutor of “Machine Learning (M.)”, University of Glasgow.
    • Winter 2022, Tutor of “Computer Vision (High-level)”, University of Glasgow.
    • Spring 2023, Tutor of “Text as Data (M.)”, University of Glasgow.
    • Spring 2023, Tutor of “Web Science (M.)”, University of Glasgow.
    • Spring 2024, Tutor of “Web Science (M.)”, University of Glasgow.
    Company Researcher
    • 2020.07 - 2021.04, Computer Vision Researcher, TAL.
    • 2019.03 - 2019.07, Research Intern, SenseTime.
    Teaching assistant, School of Informatics, Xiamen University, China
    • Spring 2018, Teaching assistant of “Introduction to Artificial Intelligence”, Xiamen University.

    Awards

    • China Scholarship Council (CSC) Scholarships, 2021.01-2025.01
    • Xiamen University Scholarship, 2017-2020