publications

* denotes equal contribution

An up-to-date list is available on Google Scholar.

2025

  1. ICCV’25
    ×
    Learning Precise Affordances from Egocentric Videos for Robotic Manipulation
    Gen Li, Nikolaos Tsagkas, Jifei Song, Ruaridh Mon-Williams, Sethu Vijayakumar, and 2 more authors
    In IEEE/CVF International Conference on Computer Vision, 2025
  2. ICCV’25
    ×
    Principles of Visual Tokens for Efficient Video Understanding
    Xinyue Hao, Gen Li, Shreyank N Gowda, Robert B Fisher, Jonathan Huang, and 2 more authors
    In IEEE/CVF International Conference on Computer Vision, 2025
  3. IROS’25
    ×
    Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
    Yizhou Huang, Fan Yang, Guoliang Zhu, Gen Li, Hao Shi, and 4 more authors
    In International Conference on Intelligent Robots and Systems, 2025
  4. NMI
    ×
    Embodied Large Language Models Enable Robots to Complete Complex Tasks in Unpredictable Environments
    Ruaridh Mon-Williams, Gen Li, Ran Long, Wenqian Du, and Chris Lucas
    Nature Machine Intelligence, 2025

2024

  1. ECCVW’24
    ×
    Watt for what: Rethinking deep learning’s energy-performance relationship
    Shreyank N Gowda, Xinyue Hao, Gen Li, Shashank Narayana Gowda, Xiaobo Jin, and 1 more author
    In European Conference on Computer Vision Workshop, 2024
  2. CVPR’24
    ×
    One-Shot Open Affordance Learning with Foundation Models
    Gen Li, Deqing Sun, Laura Sevilla-Lara, and Varun Jampani
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

  1. IJCNN’23
    ×
    Referenceless User Controllable Semantic Image Synthesis
    Jonghyun Kim, Gen Li, and Joongkyu Kim
    In International Joint Conference on Neural Networks, 2023
  2. CVPR’23
    ×
    LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding
    Gen Li, Varun Jampani, Deqing Sun, and Laura Sevilla-Lara
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021

  1. CVPR’21
    ×
    Adaptive Prototype Learning and Allocation for Few-Shot Segmentation
    Gen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, and 1 more author
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021
  2. BMVC’21
    ×
    SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder
    Jonghyun Kim, Gen Li, Cheolkon Jung, and Joongkyu Kim
    In British Machine Vision Conference, 2021
  3. PR
    ×
    Weakly-supervised temporal attention 3D network for human action recognition
    Jonghyun Kim, Gen Li, Inyong Yun, Cheolkon Jung, and Joongkyu Kim
    Pattern Recognition, 2021
  4. NC
    ×
    Edge and identity preserving network for face super-resolution
    Jonghyun Kim, Gen Li, Inyong Yun, Cheolkon Jung, and Joongkyu Kim
    Neurocomputing, 2021

2020

  1. Access
    ×
    Depth-Wise Asymmetric Bottleneck With Point-Wise Aggregation Decoder for Real-Time Semantic Segmentation in Urban Scenes
    Gen Li, Shenlu Jiang, Inyong Yun, Jonghyun Kim, and Joongkyu Kim
    IEEE Access, 2020

2019

  1. BMVC’19
    ×
    DABNet: Depth-wise asymmetric bottleneck for real-time semantic segmentation
    Gen Li, and Joongkyu Kim
    In British Machine Vision Conference, 2019