Projects

Research papers, patents, and engineering work — each with its own page.

WoundNet: Few-Shot Wound Healing Assessment

2022 – 2023

A domain-adaptable few-shot framework distinguishing healer from non-healer wounds through temporal image analysis.

ISBI 2023 medical-imagingfew-shot-learningdomain-adaptation

Context-Aware Group Anomaly Detection in Education

2022 – 2023

A deep learning framework that monitors student performance and detects anomalies in large-scale active learning courses.

AAAI 2023 anomaly-detectioneducation-AIcontext-awaredeep-learning

Text-to-Video Generation via Latent Path Construction

2021 – 2022

Pioneering text-to-video generation on realistic datasets using latent path construction for temporal modeling.

ICPR 2022 video-generationtext-to-videotemporal-modelinggenerative

Multimedia Scene Break Detection

2021 – 2024

Production-scale scene and shot boundary detection system for video content at Tubi.

Patent video-understandingtemporal-segmentationproduction

MMFT-BERT: Multimodal Fusion for Video QA

2019 – 2020

A multimodal fusion transformer with BERT encodings that achieves SOTA on the TVQA dataset.

Findings of EMNLP 2020 VQAvision-and-languagetransformersBERTvideo

Task-Focused Attention for Robotic Manipulation

2018 – 2019

Robustifying deep visuomotor policies through task-focused visual attention guided by natural language instructions.

CVPR 2019 roboticsattentionvision-and-languagevisuomotor

Visual Text Correction

2017 – 2018

A vision-and-language task for automatically detecting and correcting falsified words in video descriptions.

ECCV 2018 vision-and-languagevideotext-correction

Multi-Concept Video Retrieval

2016 – 2017

A latent-variable model for retrieving videos by multiple concepts supplied directly by users or inferred from queries.

ACM TOMM 2018 video-retrievallatent-variable-models

Video Fill In the Blank

2016 – 2017

Bidirectional LSTMs with spatial-temporal attention to predict missing words in video descriptions.

ICCV 2017 vision-and-languagevideoattentionLSTM

Deep Photo Cropper and Enhancement

2019 – 2020

Dual deep networks for precise cropping and super-resolution enhancement of embedded images.

ICIP 2020 image-processingsuper-resolutionspatial-transformers

ML for Advanced Frequency Management

2022 – 2024

Machine learning techniques for advanced frequency management in streaming media delivery.

Patent streamingmachine-learningproduction