Topics
-
Machine Learning
-
Computer Vision
-
AI in Medicine
-
Dynamical Systems
Deep Learning Theory
Deep Learning Theory encompasses the foundational mathematical principles that underpin modern neural networks and their capabilities. This field investigates nonconvex optimization techniques essential for training deep networks with billions of parameters, despite the theoretical challenges of finding global minima in highly complex loss landscapes. Learning dynamics research explores how different network architectures and training protocols affect convergence, stability, and performance over time. The concept of implicit bias helps explain why overparameterized networks tend to converge to specific solutions despite having infinitely many possible solutions that fit training data. Generalization research addresses the fundamental question of why deep networks perform well on unseen data despite their vast capacity to overfit, developing theoretical frameworks that connect architecture design, optimization algorithms, and statistical learning principles.
Trustworthy Machine Learning
Trustworthy Machine Learning focuses on developing reliable and accountable AI systems that can be safely deployed in critical real-world applications. Interpretability research aims to create models and methods that allow humans to understand how AI systems reach particular decisions, addressing the “black box” problem through techniques like feature attribution, concept-based explanations, and model distillation. Robustness investigations develop algorithms and frameworks that maintain performance under various challenges, including adversarial attacks (subtle input manipulations designed to fool models), distribution shifts (when deployment data differs from training data), and noisy or incomplete inputs that might occur in practical scenarios. Together, these components establish the theoretical and practical foundations needed to develop AI systems that can be trusted with high-stakes decisions in healthcare, transportation, security, and other critical domains.
Parsimonious Representation Learning
Parsimonious Representation Learning focuses on discovering compact, efficient ways to represent complex data while preserving essential information. Matrix factorization techniques decompose high-dimensional data matrices into lower-dimensional components, revealing latent structures and enabling applications like recommendation systems and dimensionality reduction. Subspace clustering methods identify and group data points that lie near lower-dimensional linear or affine subspaces within the ambient space, allowing for more accurate clustering of high-dimensional data with complex geometric structures. Manifold learning approaches discover nonlinear, low-dimensional structures that capture the intrinsic geometry of data, assuming that high-dimensional observations often lie on or near a lower-dimensional manifold, thus enabling more effective visualization, compression, and feature extraction while respecting the underlying data geometry.
Continual Learning
Continual Learning addresses the challenge of developing machine learning systems that can acquire knowledge incrementally over time without forgetting previously learned information—a capability that comes naturally to humans but poses significant difficulties for artificial systems. This field explores strategies to overcome catastrophic forgetting, where neural networks tend to overwrite earlier knowledge when trained on new tasks, through techniques like regularization methods that identify and protect important parameters, replay mechanisms that strategically revisit past experiences, and architectural approaches that allocate specific network components to different tasks. Continual learning research spans theoretical investigations of knowledge transfer and interference, algorithmic innovations for balancing stability and plasticity, and practical applications in scenarios where models must adapt to changing environments or sequentially presented tasks, such as in robotics, personalized recommendation systems, and healthcare monitoring.
Optimization
Optimization research in machine learning develops mathematical frameworks and algorithms to efficiently find optimal parameters or solutions across diverse learning problems. Optimization on manifolds extends traditional optimization techniques to handle constraints where solutions must lie on curved mathematical spaces, enabling applications in computer vision, robotics, and scientific computing. Optimization for learning focuses on developing specialized algorithms tailored to the unique challenges of training machine learning models, addressing issues like saddle points, local minima, and the interplay between optimization dynamics and generalization performance. The intersection of optimization and dynamical systems provides theoretical tools to analyze convergence properties and training trajectories, treating optimization algorithms as discrete or continuous dynamical systems. Distributed optimization techniques enable training models across multiple machines or devices while minimizing communication costs, becoming increasingly important for large-scale learning problems and federated learning scenarios where data privacy is paramount.
3D Vision
3D Vision research focuses on enabling computers to understand and reconstruct the three-dimensional world from visual data. Structure from Motion techniques recover both camera poses and 3D scene geometry from sequences of 2D images by identifying corresponding points across frames and solving geometric optimization problems. Motion segmentation methods separate multiple moving objects in dynamic scenes, distinguishing independent motion patterns from camera-induced apparent motion, which is crucial for autonomous navigation and video analysis. 3D scene analysis encompasses a broader set of techniques for understanding spatial relationships, object arrangements, and scene semantics in three dimensions, including depth estimation, volumetric reconstruction, and scene parsing that enables applications ranging from augmented reality and robotics to architectural modeling and autonomous driving systems.
Video
Video research in computer vision addresses the challenges of analyzing and generating temporal visual content with coherent spatial-temporal relationships. Video generation techniques create realistic or stylized video sequences using generative models that capture both appearance and motion dynamics, with applications in entertainment, simulation, and data augmentation. Action recognition methods identify human activities in video by modeling temporal patterns and motion cues, while action detection further localizes when and where specific activities occur within longer, untrimmed videos. Action segmentation extends these capabilities by precisely delineating the temporal boundaries between different activities in continuous video streams, breaking complex sequences into meaningful segments. Together, these video understanding technologies enable applications ranging from surveillance and sports analytics to human-computer interaction and automated video indexing.
Image
Image-focused computer vision research develops algorithms for understanding and manipulating still visual content across various levels of abstraction. Image generation techniques create novel visual content through generative adversarial networks, diffusion models, and other approaches that model the underlying distribution of natural or domain-specific images. Object detection methods identify and localize multiple objects within images, providing bounding boxes and class labels that enable scene understanding for applications like autonomous driving and retail analytics. Pose estimation techniques recover the spatial configuration of articulated objects, particularly human bodies or hands, enabling applications in animation, gesture recognition, and human activity analysis. Object and semantic segmentation approaches partition images into semantically meaningful regions by classifying each pixel, providing fine-grained scene decomposition that supports applications ranging from medical image analysis to computational photography and augmented reality.
Vision and Language
Vision and Language research bridges visual perception and natural language understanding to create systems that can reason about images and text in an integrated manner. Visual Question Answering develops models that can respond to natural language questions about image content, requiring multi-modal reasoning that connects visual features with linguistic concepts. Visual Grounding techniques locate objects or regions in images based on natural language descriptions, enabling applications like interactive image editing and robotic manipulation guided by verbal commands. Scene interpretation methods extract structured representations of visual scenes, identifying objects, their attributes, and their relationships to support higher-level reasoning. Image captioning systems generate natural language descriptions of visual content, requiring both visual understanding and linguistic generation capabilities to produce relevant, accurate, and contextually appropriate textual summaries of images for applications in accessibility, content indexing, and multimodal communication.
Biomedical Image Analysis
Biomedical Image Analysis employs computer vision and machine learning techniques to interpret and extract clinically relevant information from medical imaging data. Diffusion MRI analysis methods process specialized magnetic resonance signals to map tissue microstructure and neural fiber pathways in the brain, enabling studies of connectivity patterns in healthy development and neurological disorders. Explainable radiology research develops interpretable AI systems for medical image interpretation that not only provide diagnostic predictions but also justify their conclusions with visual evidence and reasoning that clinicians can verify and trust. Microscopy image analysis techniques automatically process cellular and tissue images at various scales, enabling quantification of morphological features, tracking of cellular dynamics, and identification of pathological patterns that support both clinical diagnostics and basic biological research, ultimately enhancing precision medicine through quantitative biomarkers and computational pathology.
Computer Vision for Health
Computer Vision for Health applies visual understanding technologies to healthcare challenges, creating systems that monitor, assess, and support human wellbeing. Surgical activity analysis techniques automatically recognize phases, gestures, and instrument usage during medical procedures through video analysis, enabling applications in surgical training, workflow optimization, and intraoperative decision support. Movement diagnosis systems use computer vision to quantify and characterize motor behaviors relevant to neurological and developmental conditions, providing objective assessment tools for conditions like autism spectrum disorder, where subtle movement patterns may serve as early biomarkers. Similar techniques support therapeutic monitoring in Tourette syndrome by quantifying tic frequency and severity, while rehabilitation applications track patient movements during physical therapy to provide feedback on exercise quality, measure progress over time, and personalize treatment protocols. These vision-based health systems reduce assessment subjectivity and increase accessibility of specialized healthcare expertise.
Hybrid Systems
Hybrid Systems research addresses dynamical systems that combine continuous evolution with discrete state transitions, creating mathematical frameworks for systems that switch between different operating modes. Observability studies in this domain investigate when and how the internal states of hybrid systems can be reconstructed from external measurements, which is crucial for monitoring and controlling complex systems like power grids with switching topologies or robotic systems with contact dynamics. Identification methods develop techniques to construct mathematical models of hybrid systems from experimental data, learning both the continuous dynamics within each mode and the discrete switching logic between modes. These theoretical foundations support applications in cyber-physical systems, including autonomous vehicles that switch between different control laws, smart manufacturing systems with multiple operating regimes, and biomedical devices like artificial pancreas systems that must adjust their behavior based on discrete physiological states.
Multi-agent Systems
Multi-agent Systems research studies collections of autonomous entities that interact with each other and their environment, developing frameworks for coordination, competition, and emergent behavior. Pursuit-evasion games model strategic interactions between pursuing and evading agents, addressing questions of optimal strategies, capture conditions, and equilibrium solutions with applications in security, robotics, and computational modeling of biological systems. Consensus on manifolds extends traditional agreement protocols to scenarios where agents must coordinate on curved mathematical spaces like rotation groups or spheres, which arise naturally in applications like satellite attitude synchronization, distributed camera networks, and coordinated motion planning. This field combines ideas from game theory, control theory, and distributed computing to develop theoretical guarantees and practical algorithms for emerging technologies like drone swarms, autonomous vehicle teams, and distributed robotic systems that must cooperatively solve complex tasks.
Linear Systems
Linear Systems theory provides fundamental tools for analyzing and designing systems governed by linear differential or difference equations, forming the foundation for many control and signal processing applications. Geometric approaches examine system properties through the lens of linear subspaces and transformations, revealing intrinsic structural features that inform controller design and system analysis. Sparsity considerations address scenarios where system matrices have many zero entries due to physical constraints or limited interactions between components, leading to computationally efficient algorithms and insights for large-scale systems like power networks or neural connectivity models. Observability research investigates conditions under which a system’s internal states can be reconstructed from measured outputs, including minimal sensor placement, robustness to noise, and reconstruction algorithms that enable state estimation and monitoring in applications ranging from autonomous vehicles to industrial process control and infrastructure management.
Publication by Topics
2022
Reverse Engineering $ell_p$ attacks: A block-sparse optimization approach with recovery guarantees Proceedings Article
In: International Conference on Machine Learning, 2022.
Adaptive sparse reconstruction for lensless digital holography via PSF estimation and phase retrieval Journal Article
In: Optics Express, pp. 33433-33448, 2022.
2021
What Is the Largest Sparsity Pattern That Can Be Recovered by 1-Norm Minimization? Journal Article
In: IEEE Transactions on Information Theory, vol. 67, no. 5, pp. 3060-3074, 2021.
2020
CompactNets: Compact Hierarchical Compositional Networks for Visual Recognition Journal Article
In: Computer Vision and Image Understanding, vol. 191, 2020.
Nonconvex robust low-rank matrix recovery Journal Article
In: SIAM Journal on Optimization, vol. 30, no. 1, pp. 660-686, 2020.
2019
Structured low-rank matrix factorization: Global optimality, algorithms, and applications Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 6, pp. 1468-1482, 2019.
2018
Joint Spatial-Angular Sparse Coding for Diffusion MRI with Separable Dictionaries Journal Article
In: Medical Image Analysis, vol. 48, pp. 25-42, 2018.
Theoretical Analysis of Sparse Subspace Clustering with Missing Entries Journal Article
In: arXiv:1801.00393, 2018.
On Geometric Analysis of Affine Sparse Subspace Clustering Journal Article
In: IEEE Journal on Selected Topics in Signal Processing, vol. 12, no. 6, pp. 1520–1533, 2018.
Dropout as a Low-Rank Regularizer for Matrix Factorization Proceedings Article
In: International Conference on Artificial Intelligence and Statistics, pp. 435–444, 2018.
2017
Structured Low-Rank Matrix Factorization: Global Optimality, Algorithms, and Applications Journal Article
In: CoRR, vol. abs/1708.07850, 2017.
Efficient Reconstruction of Holographic Lens-Free Images by Sparse Phase Recovery Proceedings Article
In: Medical Image Computing and Computer Assisted Intervention, pp. 109–117, Springer 2017.
Structured Sparse Subspace Clustering: A Joint Affinity Learning and Subspace Clustering Framework Journal Article
In: IEEE Transactions on Image Processing, vol. 26, no. 6, pp. 2988-3001, 2017.
($k, q$)-Compressed Sensing for dMRI with Joint Spatial-Angular Sparsity Prior Proceedings Article
In: MICCAI Workshop on Computational Diffusion MRI, 2017.
Joint Spatial-Angular Sparse Coding for Diffusion MRI with Separable Dictionaries Journal Article
In: ArXiv, 2017.
An Analysis of Dropout for Matrix Factorization Journal Article
In: arXiv preprint arXiv:1710.03487, 2017.
Removal of the Twin Image Artifact in Holographic Lens-Free Imaging by Sparse Dictionary Learning and Coding Proceedings Article
In: IEEE International Symposium on Biomedical Imaging, pp. 741–744, 2017.
Blood Cell Detection and Counting in Holographic Lens-free Imaging by Convolutional Sparse Dictionary Learning and Coding Proceedings Article
In: IEEE International Symposium on Biomedical Imaging, pp. 650–653, 2017.
2016
System and method for reconstruction of holographic lens-free images by multi-depth sparse phase recovery Miscellaneous
US Patent WO2018085655A1, 2016.
System and Method for Removal of Twin Image Artifact in Holographic Lens-free Imaging by Sparse Dictionary Learning Miscellaneous
US Patent EP3318932A1, 2016.
A Structured Sparse plus Structured Low-Rank Framework for Subspace Clustering and Completion Journal Article
In: IEEE Transactions on Signal Processing, vol. 64, no. 24, pp. 6557-6570, 2016.
Spatial-Angular Sparse Coding for HARDI Proceedings Article
In: Medical Image Computing and Computer Assisted Intervention, pp. 475–483, Springer 2016.
Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit Proceedings Article
In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3918-3927, 2016.
2015
Structured Sparse Subspace Clustering: A Unified Optimization Framework Proceedings Article
In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 277-286, 2015.
On Sufficient Conditions for Affine Sparse Subspace Clustering Proceedings Article
In: Signal Processing with Adaptive Sparse Structured Representations, 2015.
Learning Shared, Discriminative, and Compact Representations for Visual Recognition Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, iss. 99, no. 11, pp. 2218-2231, 2015.
Latent Space Sparse and Low-rank Subspace Clustering Journal Article
In: IEEE Journal of Selected Topics in Signal Processing, vol. 9, no. 4, pp. 691-701, 2015.
System and method for structured low-rank matrix factorization: optimality, algorithm, and applications to image processing Miscellaneous
US Patent US20160371563A1, 2015.
Subspace-Sparse Representation Journal Article
In: Arxiv, vol. abs/1507.01307, 2015.
Geometric Conditions for Subspace-Sparse Recovery Proceedings Article
In: International Conference on Machine Learning, pp. 1585–1593, 2015.
Geometric Conditions for Subspace-Sparse Recovery Proceedings Article
In: Signal Processing with Adaptive Sparse Structured Representations, 2015.
Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit Journal Article
In: Arxiv, vol. abs/1507.01238, 2015.
System and method for structured low-rank matrix factorization: optimality, algorithm, and applications to image processing Miscellaneous
US Patent US20160371563A1, 2015.
Global Optimality in Tensor Factorization, Deep Learning, and Beyond Journal Article
In: arXiv preprint arXiv:1506.07540, vol. abs/1506.07540, 2015.
2014
Segmentation of High Angular Resolution Diffusion MRI using Sparse Riemannian Manifold Clustering Journal Article
In: IEEE Transactions on Medical Imaging, vol. 33, no. 2, pp. 301-317, 2014.
Kernel Sparse Subspace Clustering Proceedings Article
In: IEEE International Conference on Image Processing, pp. 2849-2853, 2014.
Sparse Dictionaries for Semantic Segmentation Proceedings Article
In: European Conference on Computer Vision, 2014.
Structured Low-Rank Matrix Factorization: Optimality, Algorithm, and Applications to Image Processing Proceedings Article
In: International Conference on Machine Learning, pp. 2007–2015, 2014.
2013
Sparse Subspace Clustering: Algorithm, Theory, and Applications Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 11, pp. 2765-2781, 2013.
Latent Space Sparse Subspace Clustering Proceedings Article
In: IEEE International Conference on Computer Vision, pp. 225-232, 2013.
2012
Block-Sparse Recovery via Convex Optimization Journal Article
In: IEEE Transactions on Signal Processing, vol. 60, no. 8, pp. 4094-4107, 2012.
2011
Sparse Riemannian Manifold Clustering for HARDI Segmentation Proceedings Article
In: IEEE International Symposium on Biomedical Imaging, pp. 839–842, 2011.
2007
Projective Factorization of Multiple Rigid-Body Motions Proceedings Article
In: IEEE Conference on Computer Vision and Pattern Recognition, 2007.
2002
A Factorization Method for 3D Multi-body motion estimation and segmentation Technical Report
UC Berkeley no. UCB/ERL M02/3, 2002.
