Post
1581
πππ New Research Alert - ICCV 2025 (Oral)! ππ€π
π Title: Understanding Co-speech Gestures in-the-wild π
π Description: JEGAL is a tri-modal model that learns from gestures, speech and text simultaneously, enabling devices to interpret co-speech gestures in the wild.
π₯ Authors: @sindhuhegde , K R Prajwal, Taein Kwon, and Andrew Zisserman
π Conference: ICCV, 19 β 23 Oct, 2025 | Honolulu, Hawai'i, USA πΊπΈ
π Paper: Understanding Co-speech Gestures in-the-wild (2503.22668)
π Web Page: https://www.robots.ox.ac.uk/~vgg/research/jegal
π Repository: https://github.com/Sindhu-Hegde/jegal
πΊ Video: https://www.youtube.com/watch?v=TYFOLKfM-rM
π ICCV-2023-25-Papers: https://github.com/DmitryRyumin/ICCV-2023-25-Papers
π Added to the Human Modeling Section: https://github.com/DmitryRyumin/ICCV-2023-25-Papers/blob/main/sections/2025/main/human-modeling.md
π More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin
π Keywords: #CoSpeechGestures #GestureUnderstanding #TriModalRepresentation #MultimodalLearning #AI #ICCV2025 #ResearchHighlight
π Title: Understanding Co-speech Gestures in-the-wild π
π Description: JEGAL is a tri-modal model that learns from gestures, speech and text simultaneously, enabling devices to interpret co-speech gestures in the wild.
π₯ Authors: @sindhuhegde , K R Prajwal, Taein Kwon, and Andrew Zisserman
π Conference: ICCV, 19 β 23 Oct, 2025 | Honolulu, Hawai'i, USA πΊπΈ
π Paper: Understanding Co-speech Gestures in-the-wild (2503.22668)
π Web Page: https://www.robots.ox.ac.uk/~vgg/research/jegal
π Repository: https://github.com/Sindhu-Hegde/jegal
πΊ Video: https://www.youtube.com/watch?v=TYFOLKfM-rM
π ICCV-2023-25-Papers: https://github.com/DmitryRyumin/ICCV-2023-25-Papers
π Added to the Human Modeling Section: https://github.com/DmitryRyumin/ICCV-2023-25-Papers/blob/main/sections/2025/main/human-modeling.md
π More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin
π Keywords: #CoSpeechGestures #GestureUnderstanding #TriModalRepresentation #MultimodalLearning #AI #ICCV2025 #ResearchHighlight