Researchdirections

Adding a Paper Page The GAMMA website lists several top-level research directions as listed in ../research.md. The homepages of each direction are in other .md files in this folder. All folders in this page contain sub-areas of a top-level direction. Members are expected to create pages for your papers to these sub-area folders, and link them in their corresponding top-level .md files. If you intend to add a new top-level research direction (which is unconventional), please consult with your advisor and system-admins.

Metaverse at GAMMA METAVERSE at GAMMA Project Dost User-Centric VR Affective XR Locomotion Human Animation Placement Haptics Search Menu – Lorem ipsum Feugiat tempus veroeros dolor Dolor sit amet Sed vitae justo condimentum Feugiat veroeros Phasellus sed ultricies mi congue Etiam sed consequat Porta lectus amet ultricies

AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments

Abstract We present AMCO, a novel navigation method for quadruped robots that adaptively combines vision-based and proprioception-based perception capabilities. Our approach uses three cost maps: general knowledge map; traversability history map; and current proprioception map; which are derived from a robot’s vision and proprioception data, and couples them to obtain a coupled traversability cost map for navigation. The general knowledge map, encodes terrains semantically segmented from visual sensing, and represents a terrain’s typically expected traversability.

3D-OGSE: Probabilistically Complete Online Safe and Smooth Trajectory Generation using Generalized Shape Expansion in Unknown 3-D Environments

Abstract We present an online motion planning algorithm (3D-OGSE) for generating smooth, collision-free trajectories over multiple planning iterations for a 3-D agent operating in an unknown, obstacle-cluttered, 3-D environment. In each planning iteration, 3D-OGSE constructs an obstacle-free region termed ‘generalized shape’ based on the locally-sensed environment information. A collision-free path is computed by sampling points in the generalized shape and is used to generate a smooth, time-parametrized trajectory by minimizing snap.

3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos

Abstract We present 3MASSIV, a multilingual, multimodal and multi-aspect, expertly-annotated dataset of diverse short videos extracted from short-video social media platform - Moj. 3MASSIV comprises of 50k short videos (20 seconds average duration) and 100K unlabeled videos in 11 different languages and captures popular short video trends like pranks, fails, romance, comedy expressed via unique audio-visual formats like self-shot videos, reaction videos, lip-synching, self-sung songs, etc. 3MASSIV presents an opportunity for multimodal and multilingual semantic understanding on these unique videos by annotating them for concepts, affective states, media types, and audio language.

A Framework for Active Haptic Guidance Using Robotic Haptic Proxies

Abstract Haptic feedback is an important component of creating an immersive virtual experience. Traditionally, haptic forces are rendered in response to the user’s interactions with the virtual environment. In this work, we explore the idea of rendering haptic forces in a proactive manner, with the explicit intention to influence the user’s behavior through compelling haptic forces. To this end, we present a framework for active haptic guidance in mixed reality, using one or more robotic haptic proxies to influence user behavior and deliver a safer and more immersive virtual experience.

A Repulsive Force Unit for Garment Collision Handling in Neural Networks

Abstract Despite recent success, deep learning-based methods for predicting 3D garment deformation under body motion suffer from interpenetration problems between the garment and the body. To address this problem, we propose a novel collision handling neural network layer called Repulsive Force Unit (ReFU). Based on the signed distance function (SDF) of the underlying body and the current garment vertex positions, ReFU predicts the per-vertex offsets that push any interpenetrating vertex to a collision-free configuration while preserving the fine geometric details.

AADS: Augmented Autonomous Driving Simulation Using Data-Driven Algorithms.

Abstract Simulation systems have become essential to the development and validation of autonomous driving (AD) technologies. The prevailing state-of-the-art approach for simulation uses game engines or high-fidelity computer graphics (CG) models to create driving scenarios. However, creating CG models and vehicle movements (the assets for simulation) remain manual tasks that can be costly and time consuming. In addition, CG images still lack the richness and authenticity of real-world images, and using CG images for training leads to degraded performance.

ABC-Net: Semi-Supervised Multimodal GAN-based Engagement Detection using an Affective, Behavioral and Cognitive Model

– Abstract We present ABC-Net, a novel semi-supervised multimodal GAN framework to detect engagement levels in video conversations based on psychology literature. We use three constructs: behavioral, cognitive, and affective engagement, to extract various features that can effectively capture engagement levels. We feed these features to our semi-supervised GAN network that does regression using these latent representations to obtain the corresponding valence and arousal values, which are then categorized into different levels of engagements.