Topics

Deep Learning Theory

Deep Learning Theory encompasses the foundational mathematical principles that underpin modern neural networks and their capabilities. This field investigates nonconvex optimization techniques essential for training deep networks with billions of parameters, despite the theoretical challenges of finding global minima in highly complex loss landscapes. Learning dynamics research explores how different network architectures and training protocols affect convergence, stability, and performance over time. The concept of implicit bias helps explain why overparameterized networks tend to converge to specific solutions despite having infinitely many possible solutions that fit training data. Generalization research addresses the fundamental question of why deep networks perform well on unseen data despite their vast capacity to overfit, developing theoretical frameworks that connect architecture design, optimization algorithms, and statistical learning principles.

Trustworthy Machine Learning

Trustworthy Machine Learning focuses on developing reliable and accountable AI systems that can be safely deployed in critical real-world applications. Interpretability research aims to create models and methods that allow humans to understand how AI systems reach particular decisions, addressing the “black box” problem through techniques like feature attribution, concept-based explanations, and model distillation. Robustness investigations develop algorithms and frameworks that maintain performance under various challenges, including adversarial attacks (subtle input manipulations designed to fool models), distribution shifts (when deployment data differs from training data), and noisy or incomplete inputs that might occur in practical scenarios. Together, these components establish the theoretical and practical foundations needed to develop AI systems that can be trusted with high-stakes decisions in healthcare, transportation, security, and other critical domains.

Parsimonious Representation Learning

Parsimonious Representation Learning focuses on discovering compact, efficient ways to represent complex data while preserving essential information. Matrix factorization techniques decompose high-dimensional data matrices into lower-dimensional components, revealing latent structures and enabling applications like recommendation systems and dimensionality reduction. Subspace clustering methods identify and group data points that lie near lower-dimensional linear or affine subspaces within the ambient space, allowing for more accurate clustering of high-dimensional data with complex geometric structures. Manifold learning approaches discover nonlinear, low-dimensional structures that capture the intrinsic geometry of data, assuming that high-dimensional observations often lie on or near a lower-dimensional manifold, thus enabling more effective visualization, compression, and feature extraction while respecting the underlying data geometry.

Continual Learning

Continual Learning addresses the challenge of developing machine learning systems that can acquire knowledge incrementally over time without forgetting previously learned information—a capability that comes naturally to humans but poses significant difficulties for artificial systems. This field explores strategies to overcome catastrophic forgetting, where neural networks tend to overwrite earlier knowledge when trained on new tasks, through techniques like regularization methods that identify and protect important parameters, replay mechanisms that strategically revisit past experiences, and architectural approaches that allocate specific network components to different tasks. Continual learning research spans theoretical investigations of knowledge transfer and interference, algorithmic innovations for balancing stability and plasticity, and practical applications in scenarios where models must adapt to changing environments or sequentially presented tasks, such as in robotics, personalized recommendation systems, and healthcare monitoring.

Optimization

Optimization research in machine learning develops mathematical frameworks and algorithms to efficiently find optimal parameters or solutions across diverse learning problems. Optimization on manifolds extends traditional optimization techniques to handle constraints where solutions must lie on curved mathematical spaces, enabling applications in computer vision, robotics, and scientific computing. Optimization for learning focuses on developing specialized algorithms tailored to the unique challenges of training machine learning models, addressing issues like saddle points, local minima, and the interplay between optimization dynamics and generalization performance. The intersection of optimization and dynamical systems provides theoretical tools to analyze convergence properties and training trajectories, treating optimization algorithms as discrete or continuous dynamical systems. Distributed optimization techniques enable training models across multiple machines or devices while minimizing communication costs, becoming increasingly important for large-scale learning problems and federated learning scenarios where data privacy is paramount.

3D Vision

3D Vision research focuses on enabling computers to understand and reconstruct the three-dimensional world from visual data. Structure from Motion techniques recover both camera poses and 3D scene geometry from sequences of 2D images by identifying corresponding points across frames and solving geometric optimization problems. Motion segmentation methods separate multiple moving objects in dynamic scenes, distinguishing independent motion patterns from camera-induced apparent motion, which is crucial for autonomous navigation and video analysis. 3D scene analysis encompasses a broader set of techniques for understanding spatial relationships, object arrangements, and scene semantics in three dimensions, including depth estimation, volumetric reconstruction, and scene parsing that enables applications ranging from augmented reality and robotics to architectural modeling and autonomous driving systems.

Video

Video research in computer vision addresses the challenges of analyzing and generating temporal visual content with coherent spatial-temporal relationships. Video generation techniques create realistic or stylized video sequences using generative models that capture both appearance and motion dynamics, with applications in entertainment, simulation, and data augmentation. Action recognition methods identify human activities in video by modeling temporal patterns and motion cues, while action detection further localizes when and where specific activities occur within longer, untrimmed videos. Action segmentation extends these capabilities by precisely delineating the temporal boundaries between different activities in continuous video streams, breaking complex sequences into meaningful segments. Together, these video understanding technologies enable applications ranging from surveillance and sports analytics to human-computer interaction and automated video indexing.

Image

Image-focused computer vision research develops algorithms for understanding and manipulating still visual content across various levels of abstraction. Image generation techniques create novel visual content through generative adversarial networks, diffusion models, and other approaches that model the underlying distribution of natural or domain-specific images. Object detection methods identify and localize multiple objects within images, providing bounding boxes and class labels that enable scene understanding for applications like autonomous driving and retail analytics. Pose estimation techniques recover the spatial configuration of articulated objects, particularly human bodies or hands, enabling applications in animation, gesture recognition, and human activity analysis. Object and semantic segmentation approaches partition images into semantically meaningful regions by classifying each pixel, providing fine-grained scene decomposition that supports applications ranging from medical image analysis to computational photography and augmented reality.

Vision and Language

Vision and Language research bridges visual perception and natural language understanding to create systems that can reason about images and text in an integrated manner. Visual Question Answering develops models that can respond to natural language questions about image content, requiring multi-modal reasoning that connects visual features with linguistic concepts. Visual Grounding techniques locate objects or regions in images based on natural language descriptions, enabling applications like interactive image editing and robotic manipulation guided by verbal commands. Scene interpretation methods extract structured representations of visual scenes, identifying objects, their attributes, and their relationships to support higher-level reasoning. Image captioning systems generate natural language descriptions of visual content, requiring both visual understanding and linguistic generation capabilities to produce relevant, accurate, and contextually appropriate textual summaries of images for applications in accessibility, content indexing, and multimodal communication.

Biomedical Image Analysis

Biomedical Image Analysis employs computer vision and machine learning techniques to interpret and extract clinically relevant information from medical imaging data. Diffusion MRI analysis methods process specialized magnetic resonance signals to map tissue microstructure and neural fiber pathways in the brain, enabling studies of connectivity patterns in healthy development and neurological disorders. Explainable radiology research develops interpretable AI systems for medical image interpretation that not only provide diagnostic predictions but also justify their conclusions with visual evidence and reasoning that clinicians can verify and trust. Microscopy image analysis techniques automatically process cellular and tissue images at various scales, enabling quantification of morphological features, tracking of cellular dynamics, and identification of pathological patterns that support both clinical diagnostics and basic biological research, ultimately enhancing precision medicine through quantitative biomarkers and computational pathology.

Computer Vision for Health

Computer Vision for Health applies visual understanding technologies to healthcare challenges, creating systems that monitor, assess, and support human wellbeing. Surgical activity analysis techniques automatically recognize phases, gestures, and instrument usage during medical procedures through video analysis, enabling applications in surgical training, workflow optimization, and intraoperative decision support. Movement diagnosis systems use computer vision to quantify and characterize motor behaviors relevant to neurological and developmental conditions, providing objective assessment tools for conditions like autism spectrum disorder, where subtle movement patterns may serve as early biomarkers. Similar techniques support therapeutic monitoring in Tourette syndrome by quantifying tic frequency and severity, while rehabilitation applications track patient movements during physical therapy to provide feedback on exercise quality, measure progress over time, and personalize treatment protocols. These vision-based health systems reduce assessment subjectivity and increase accessibility of specialized healthcare expertise.

Hybrid Systems

Hybrid Systems research addresses dynamical systems that combine continuous evolution with discrete state transitions, creating mathematical frameworks for systems that switch between different operating modes. Observability studies in this domain investigate when and how the internal states of hybrid systems can be reconstructed from external measurements, which is crucial for monitoring and controlling complex systems like power grids with switching topologies or robotic systems with contact dynamics. Identification methods develop techniques to construct mathematical models of hybrid systems from experimental data, learning both the continuous dynamics within each mode and the discrete switching logic between modes. These theoretical foundations support applications in cyber-physical systems, including autonomous vehicles that switch between different control laws, smart manufacturing systems with multiple operating regimes, and biomedical devices like artificial pancreas systems that must adjust their behavior based on discrete physiological states.

Multi-agent Systems

Multi-agent Systems research studies collections of autonomous entities that interact with each other and their environment, developing frameworks for coordination, competition, and emergent behavior. Pursuit-evasion games model strategic interactions between pursuing and evading agents, addressing questions of optimal strategies, capture conditions, and equilibrium solutions with applications in security, robotics, and computational modeling of biological systems. Consensus on manifolds extends traditional agreement protocols to scenarios where agents must coordinate on curved mathematical spaces like rotation groups or spheres, which arise naturally in applications like satellite attitude synchronization, distributed camera networks, and coordinated motion planning. This field combines ideas from game theory, control theory, and distributed computing to develop theoretical guarantees and practical algorithms for emerging technologies like drone swarms, autonomous vehicle teams, and distributed robotic systems that must cooperatively solve complex tasks.

Linear Systems

Linear Systems theory provides fundamental tools for analyzing and designing systems governed by linear differential or difference equations, forming the foundation for many control and signal processing applications. Geometric approaches examine system properties through the lens of linear subspaces and transformations, revealing intrinsic structural features that inform controller design and system analysis. Sparsity considerations address scenarios where system matrices have many zero entries due to physical constraints or limited interactions between components, leading to computationally efficient algorithms and insights for large-scale systems like power networks or neural connectivity models. Observability research investigates conditions under which a system’s internal states can be reconstructed from measured outputs, including minimal sensor placement, robustness to noise, and reconstruction algorithms that enable state estimation and monitoring in applications ranging from autonomous vehicles to industrial process control and infrastructure management.

Publication by Topics

3D Vision AI in Medicine Biomedical Image Analysis Computer Vision Deep Learning Theory Dynamic textures Dynamical Systems Hybrid system identification Hybrid Systems Image Linear System Machine Learning Multi-agent Systems Optimization Parsimonious Representation Learning Trustworthy Machine Learning Video Vision and Language

Show all

2022

Darshan Thaker; Paris Giampouras; René Vidal

Reverse Engineering $ell_p$ attacks: A block-sparse optimization approach with recovery guarantees Proceedings Article

In: International Conference on Machine Learning, 2022.

BibTeX | Tags: Optimization, Parsimonious Representation Learning

Carolina Pacheco; Gregory N. McKay; Anisha Oommen; Nicholas J. Durr; René Vidal; Benjamin D. Haeffele

Adaptive sparse reconstruction for lensless digital holography via PSF estimation and phase retrieval Journal Article

In: Optics Express, pp. 33433-33448, 2022.

BibTeX | Tags: Parsimonious Representation Learning

2021

Mustafa Kaba; Mengnan Zhao; Rene Vidal; Daniel P. Robinson; Enrique Mallada

What Is the Largest Sparsity Pattern That Can Be Recovered by 1-Norm Minimization? Journal Article

In: IEEE Transactions on Information Theory, vol. 67, no. 5, pp. 3060-3074, 2021.

BibTeX | Tags: Optimization, Parsimonious Representation Learning

2020

H. Lobel; R. Vidal; A. Soto

CompactNets: Compact Hierarchical Compositional Networks for Visual Recognition Journal Article

In: Computer Vision and Image Understanding, vol. 191, 2020.

BibTeX | Tags: Computer Vision, Image, Parsimonious Representation Learning

Xiao Li; Zhihui Zhu; Anthony Man-Cho So; Rene Vidal

Nonconvex robust low-rank matrix recovery Journal Article

In: SIAM Journal on Optimization, vol. 30, no. 1, pp. 660-686, 2020.

BibTeX | Tags: Linear System, Optimization, Parsimonious Representation Learning, Trustworthy Machine Learning

2019

Benjamin D Haeffele; René Vidal

Structured low-rank matrix factorization: Global optimality, algorithms, and applications Journal Article

In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 6, pp. 1468-1482, 2019.

BibTeX | Tags: Deep Learning Theory, Linear System, Parsimonious Representation Learning

2018

E. Schwab; R. Vidal; N. Charon

Joint Spatial-Angular Sparse Coding for Diffusion MRI with Separable Dictionaries Journal Article

In: Medical Image Analysis, vol. 48, pp. 25-42, 2018.

BibTeX | Tags: AI in Medicine, Biomedical Image Analysis, Image, Parsimonious Representation Learning

M. C. Tsakiris; R. Vidal

Theoretical Analysis of Sparse Subspace Clustering with Missing Entries Journal Article

In: arXiv:1801.00393, 2018.

BibTeX | Tags: Linear System, Machine Learning, Parsimonious Representation Learning

Chun-Guang Li; Chong You; René Vidal

On Geometric Analysis of Affine Sparse Subspace Clustering Journal Article

In: IEEE Journal on Selected Topics in Signal Processing, vol. 12, no. 6, pp. 1520–1533, 2018.

BibTeX | Tags: Linear System, Machine Learning, Parsimonious Representation Learning

Jacopo Cavazza; Benjamin D Haeffele; Connor Lane; Pietro Morerio; Vittorio Murino; Rene Vidal

Dropout as a Low-Rank Regularizer for Matrix Factorization Proceedings Article

In: International Conference on Artificial Intelligence and Statistics, pp. 435–444, 2018.

BibTeX | Tags: Linear System, Parsimonious Representation Learning

2017

Benjamin D. Haeffele; René Vidal

Structured Low-Rank Matrix Factorization: Global Optimality, Algorithms, and Applications Journal Article

In: CoRR, vol. abs/1708.07850, 2017.

Links | BibTeX | Tags: Deep Learning Theory, Linear System, Parsimonious Representation Learning

B. D. Haeffele; R. Stahl; G. Vanmeerbeeck; R. Vidal

Efficient Reconstruction of Holographic Lens-Free Images by Sparse Phase Recovery Proceedings Article

In: Medical Image Computing and Computer Assisted Intervention, pp. 109–117, Springer 2017.

BibTeX | Tags: Parsimonious Representation Learning

Chun-Guang Li; Chong You; René Vidal

Structured Sparse Subspace Clustering: A Joint Affinity Learning and Subspace Clustering Framework Journal Article

In: IEEE Transactions on Image Processing, vol. 26, no. 6, pp. 2988-3001, 2017.

Links | BibTeX | Tags: Image, Linear System, Machine Learning, Parsimonious Representation Learning

E. Schwab; R. Vidal; N. Charon

($k, q$)-Compressed Sensing for dMRI with Joint Spatial-Angular Sparsity Prior Proceedings Article

In: MICCAI Workshop on Computational Diffusion MRI, 2017.

BibTeX | Tags: Parsimonious Representation Learning

E. Schwab; R. Vidal; N. Charon

Joint Spatial-Angular Sparse Coding for Diffusion MRI with Separable Dictionaries Journal Article

In: ArXiv, 2017.

BibTeX | Tags: Biomedical Image Analysis, Parsimonious Representation Learning

Jacopo Cavazza; Connor Lane; Benjamin D Haeffele; Vittorio Murino; Rene Vidal

An Analysis of Dropout for Matrix Factorization Journal Article

In: arXiv preprint arXiv:1710.03487, 2017.

BibTeX | Tags: Linear System, Parsimonious Representation Learning

Benjamin D Haeffele; Sophie Roth; Lin Zhou; Rene Vidal

Removal of the Twin Image Artifact in Holographic Lens-Free Imaging by Sparse Dictionary Learning and Coding Proceedings Article

In: IEEE International Symposium on Biomedical Imaging, pp. 741–744, 2017.

BibTeX | Tags: Image, Machine Learning, Parsimonious Representation Learning

Florence Yellin; Benjamin D Haeffele; Rene Vidal

Blood Cell Detection and Counting in Holographic Lens-free Imaging by Convolutional Sparse Dictionary Learning and Coding Proceedings Article

In: IEEE International Symposium on Biomedical Imaging, pp. 650–653, 2017.

BibTeX | Tags: AI in Medicine, Machine Learning, Parsimonious Representation Learning

2016

R. Vidal; B. Haeffele

System and method for reconstruction of holographic lens-free images by multi-depth sparse phase recovery Miscellaneous

US Patent WO2018085655A1, 2016.

BibTeX | Tags: 3D Vision, Parsimonious Representation Learning

R. Vidal; B. Haeffele

System and Method for Removal of Twin Image Artifact in Holographic Lens-free Imaging by Sparse Dictionary Learning Miscellaneous

US Patent EP3318932A1, 2016.

BibTeX | Tags: Image, Machine Learning, Parsimonious Representation Learning

C. -G. Li; R. Vidal

A Structured Sparse plus Structured Low-Rank Framework for Subspace Clustering and Completion Journal Article

In: IEEE Transactions on Signal Processing, vol. 64, no. 24, pp. 6557-6570, 2016.

Links | BibTeX | Tags: Linear System, Machine Learning, Parsimonious Representation Learning

E. Schwab; R. Vidal; N. Charon

Spatial-Angular Sparse Coding for HARDI Proceedings Article

In: Medical Image Computing and Computer Assisted Intervention, pp. 475–483, Springer 2016.

BibTeX | Tags: Biomedical Image Analysis, Parsimonious Representation Learning

Chong You; Daniel P. Robinson; René Vidal

Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit Proceedings Article

In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3918-3927, 2016.

BibTeX | Tags: Linear System, Machine Learning, Optimization, Parsimonious Representation Learning

2015

C. -G. Li; R. Vidal

Structured Sparse Subspace Clustering: A Unified Optimization Framework Proceedings Article

In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 277-286, 2015.

BibTeX | Tags: Linear System, Machine Learning, Optimization, Parsimonious Representation Learning

C. -G. Li; C. You; R. Vidal

On Sufficient Conditions for Affine Sparse Subspace Clustering Proceedings Article

In: Signal Processing with Adaptive Sparse Structured Representations, 2015.

BibTeX | Tags: Parsimonious Representation Learning

H. Lobel; R. Vidal; A. Soto

Learning Shared, Discriminative, and Compact Representations for Visual Recognition Journal Article

In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, iss. 99, no. 11, pp. 2218-2231, 2015.

BibTeX | Tags: Computer Vision, Machine Learning, Parsimonious Representation Learning

V. M. Patel; H. V. Nguyen; R. Vidal

Latent Space Sparse and Low-rank Subspace Clustering Journal Article

In: IEEE Journal of Selected Topics in Signal Processing, vol. 9, no. 4, pp. 691-701, 2015.

BibTeX | Tags: Linear System, Machine Learning, Parsimonious Representation Learning

R. Vidal; B. Haeffele; E. Young

System and method for structured low-rank matrix factorization: optimality, algorithm, and applications to image processing Miscellaneous

US Patent US20160371563A1, 2015.

BibTeX | Tags: Image, Linear System, Parsimonious Representation Learning

C. You; R. Vidal

Subspace-Sparse Representation Journal Article

In: Arxiv, vol. abs/1507.01307, 2015.

BibTeX | Tags: Linear System, Parsimonious Representation Learning

Chong You; René Vidal

Geometric Conditions for Subspace-Sparse Recovery Proceedings Article

In: International Conference on Machine Learning, pp. 1585–1593, 2015.

BibTeX | Tags: Linear System, Parsimonious Representation Learning

C. You; R. Vidal

Geometric Conditions for Subspace-Sparse Recovery Proceedings Article

In: Signal Processing with Adaptive Sparse Structured Representations, 2015.

BibTeX | Tags: Parsimonious Representation Learning

C. You; D. Robinson; R. Vidal

Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit Journal Article

In: Arxiv, vol. abs/1507.01238, 2015.

BibTeX | Tags: Linear System, Machine Learning, Optimization, Parsimonious Representation Learning

R. Vidal; B. Haeffele; E. Young

System and method for structured low-rank matrix factorization: optimality, algorithm, and applications to image processing Miscellaneous

US Patent US20160371563A1, 2015.

BibTeX | Tags: Image, Linear System, Parsimonious Representation Learning

Benjamin D Haeffele; René Vidal

Global Optimality in Tensor Factorization, Deep Learning, and Beyond Journal Article

In: arXiv preprint arXiv:1506.07540, vol. abs/1506.07540, 2015.

BibTeX | Tags: Deep Learning Theory, Linear System, Machine Learning, Parsimonious Representation Learning

2014

H. E. Cetingül; M. Wright; P. Thompson; R. Vidal

Segmentation of High Angular Resolution Diffusion MRI using Sparse Riemannian Manifold Clustering Journal Article

In: IEEE Transactions on Medical Imaging, vol. 33, no. 2, pp. 301-317, 2014.

BibTeX | Tags: AI in Medicine, Biomedical Image Analysis, Machine Learning, Parsimonious Representation Learning

V. M. Patel; R. Vidal

Kernel Sparse Subspace Clustering Proceedings Article

In: IEEE International Conference on Image Processing, pp. 2849-2853, 2014.

BibTeX | Tags: Linear System, Machine Learning, Parsimonious Representation Learning

L. Tao; F. Porikli; R. Vidal

Sparse Dictionaries for Semantic Segmentation Proceedings Article

In: European Conference on Computer Vision, 2014.

BibTeX | Tags: Computer Vision, Parsimonious Representation Learning

Benjamin D Haeffele; Eric Young; Rene Vidal

Structured Low-Rank Matrix Factorization: Optimality, Algorithm, and Applications to Image Processing Proceedings Article

In: International Conference on Machine Learning, pp. 2007–2015, 2014.

BibTeX | Tags: Image, Linear System, Parsimonious Representation Learning

2013

Ehsan Elhamifar; René Vidal

Sparse Subspace Clustering: Algorithm, Theory, and Applications Journal Article

In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 11, pp. 2765-2781, 2013.

BibTeX | Tags: Linear System, Machine Learning, Parsimonious Representation Learning

V. M. Patel; H. V. Nguyen; R. Vidal

Latent Space Sparse Subspace Clustering Proceedings Article

In: IEEE International Conference on Computer Vision, pp. 225-232, 2013.

BibTeX | Tags: Linear System, Machine Learning, Parsimonious Representation Learning

2012

E. Elhamifar; R. Vidal

Block-Sparse Recovery via Convex Optimization Journal Article

In: IEEE Transactions on Signal Processing, vol. 60, no. 8, pp. 4094-4107, 2012.

BibTeX | Tags: Optimization, Parsimonious Representation Learning

2011

H. E. Cetingül; R. Vidal

Sparse Riemannian Manifold Clustering for HARDI Segmentation Proceedings Article

In: IEEE International Symposium on Biomedical Imaging, pp. 839–842, 2011.

BibTeX | Tags: Biomedical Image Analysis, Machine Learning, Parsimonious Representation Learning

2007

T. Li; V. Kallem; D. Singaraju; R. Vidal

Projective Factorization of Multiple Rigid-Body Motions Proceedings Article

In: IEEE Conference on Computer Vision and Pattern Recognition, 2007.

BibTeX | Tags: Linear System, Parsimonious Representation Learning

2002

R. Vidal; S. Soatto; S. Sastry

A Factorization Method for 3D Multi-body motion estimation and segmentation Technical Report

UC Berkeley no. UCB/ERL M02/3, 2002.

BibTeX | Tags: 3D Vision, Linear System, Parsimonious Representation Learning, Video

Vidal Lab

Research

Topics

Machine Learning

Computer Vision

AI in Medicine

Dynamical Systems

Publication by Topics

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2007

2002