Computer Vision
Separating the" Chirp" from the" Chat": Self-supervised Visual Grounding of Sound and Language
IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
Wonderjourney: Going from anywhere to everywhere
IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
International Conference on Learning Representations (ICLR) 2024
Diffusion with forward models: Solving stochastic inverse problems without direct supervision
Advances in Neural Information Processing Systems 2023
Evaluating Peripheral Vision as an Input Transformation to Understand Object Detection Model Behavior
NeuRIPS 2023 Workshop on Gaze Meets ML
StatTexNet: Evaluating the Importance of Statistical Parameters for Pyramid-Based Texture and Peripheral Vision Models
NeuRIPS 2023 Workshop on Gaze Meets ML
COCO-Periph: Bridging the Gap Between Human and Machine Perception in the Periphery
International Conference on Learning Representations 2024
Evaluating Pyramid-Based Image Statistics Using Contrastive Learning
Journal of Vision
Metaclue: Towards comprehensive visual metaphors research
IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023
Can Shadows Reveal Biometric Information?
Winter Conference on Applications of Computer Vision (WACV) 2023
Associating objects and their effects in video through coordination games
Advances in Neural Information Processing Systems 2022
Structure and motion from casual videos
European Conference on Computer Vision (ECCV) 2022
Disentangling architecture and training for optical flow
European Conference on Computer Vision (ECCV) 2022
Maskgit: Masked generative image transformer
Conference on Computer Vision and Pattern Recognition (CVPR) 2022
Unsupervised semantic segmentation by distilling feature correspondences
International Conference on Learning Representations (ICLR) 2022
Nerfactor: Neural factorization of shape and reflectance under an unknown illumination
ACM Transactions on Graphics (TOG), 2021
Light field networks: Neural scene representations with single-evaluation rendering
Advances in Neural Information Processing Systems (NeurIPS) 2021
Explaining in style: Training a gan to explain a classifier in stylespace
International Conference on Computer Vision (ICCV) 2021
Thundr: Transformer-based 3d human reconstruction with markers
International Conference on Computer Vision (ICCV) 2021
What you can learn by staring at a blank wall
International Conference on Computer Vision (ICCV) 2021
MosAIc: Finding Artistic Connections across Culture with Conditional Image Retrieval
NeurIPS 2020 Competition and Demonstration Track
Autoflow: Learning a better training set for optical flow
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Lasr: Learning articulated shape reconstruction from a monocular video
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Neural Descent for Visual 3D Human Pose and Shape
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Omnimatte: associating objects and their effects in video
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Multi-plane program induction with 3d box priors
Advances in Neural Information Processing Systems (NeurIPS) 2020
Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows
European Conference on Computer Vision (ECCV), 2020
GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models
Conference on Computer Vision and Pattern Recognition (CVPR), 2020
SpeedNet: Learning the Speediness in Videos
Conference on Computer Vision and Pattern Recognition (CVPR), 2020
Perspective Plane Program Induction From a Single Image
Computer Vision and Pattern Recognition(CVPR), 2020
Semantic Pyramid for Image Generation
Computer Vision and Pattern Recognition (CVPR), 2020
MannequinChallenge: Learning the Depths of Moving People by Watching Frozen People
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020
Visual Deprojection: Probabilistic Recovery of Collapsed Dimensions
International Conference on Computer Vision (ICCV) 2019
Boundless: Generative adversarial networks for image extension
IEEE International Conference on Computer Vision(ICCV), 2019
Learning shape templates with structured implicit functions
IEEE International Conference on Computer Vision(ICCV), 2019
Program-Guided Image Manipulators
IEEE International Conference on Computer Vision (ICCV), 2019
Using unknown occluders to recover hidden scenes
IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2019
Reasoning about physical interactions with object-centric models
International Conference on Learning Representations (ICLR), 2019
Deep Audio Priors Emerge From Harmonic Convolutional Networks
International Conference on Learning Representations (ICLR), 2019
Learning the Depths of Moving People by Watching Frozen People
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Best Paper Honorable Mention.
Speech2Face: Learning the Face Behind a Voice
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Using Unknown Occluders to Recover Hidden Scenes
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2019
Learning to Infer and Execute 3D Shape Programs
International Conference on Learning Representations (ICLR), 2019
Reasoning About Physical Interactions with Object-Centric Models
International Conference on Learning Representations (ICLR), 2019
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks
International Conference on Learning Representations (ICLR), 2019
Unsupervised Discovery of Parts, Structure, and Dynamics
International Conference on Learning Representations (ICLR), 2019
Learning to Describe Scenes with Programs
International Conference on Learning Representations (ICLR), 2019
Video Enhancement with Task-Oriented Flow
International Journal of Computer Vision (IJCV), 2019
Learning to Reconstruct Shapes from Unseen Classes
Neural Information Processing Systems (NeurIPS), 2018. Oral presentation.
3D-Aware Scene Manipulation via Inverse Graphics
Neural Information Processing Systems (NeurIPS), 2018
Learning to Exploit Stability for 3D Scene Parsing
Neural Information Processing Systems (NeurIPS), 2018
Visual Object Networks: Image Generation with Disentangled 3D Representations
Neural Information Processing Systems (NeurIPS), 2018
ShadowCam: Real-Time Detection of Moving Obstacles Behind A Corner For Autonomous Vehicles
International Conference on Intelligent Transportation Systems (ITSC), 2018
MoSculp: Interactive Visualization of Shape and Time
ACM Symposium on User Interface Software and Technology (UIST), 2018
3D Shape Perception from Monocular Vision, Touch, and Shape Priors
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018
Learning Shape Priors for Single-View 3D Completion and Reconstruction
European Conference on Computer Vision (ECCV), 2018
Physical Primitive Decomposition
European Conference on Computer Vision (ECCV), 2018
Seeing Tree Structure from Vibration
European Conference on Computer Vision (ECCV), 2018
Best-buddies similarity—robust template matching using mutual nearest neighbors
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018
Unsupervised Training for 3D Morphable Model Regression
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Inferring Light Fields from Shadows
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. Spotlight presentation.
Learning and Using the Arrow of Time
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Smart, Sparse Contours to Represent and Edit Images
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Exploiting Occlusion in Non-Line-of-Sight Active Imaging
IEEE Transactions on Computational Imaging, 2018
Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation
SIGGRAPH, 2018
Cognitive Load Estimation in the Wild
CHI Conference on Human Factors in Computing Systems, 2018
3D Interpreter Networks for Viewer-Centered Wireframe Modeling
International Journal of Computer Vision (IJCV), 2018
Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning
Interactional Journal of Computer Vision (IJCV), 2018
Learning to See Physics via Visual De-animation
Neural Information Processing Systems (NIPS), 2017. Spotlight presentation.
Shape and Material from Sound
Neural Information Processing Systems (NIPS), 2017. Spotlight presentation.
Marrnet: 3D shape reconstruction via 2.5D sketches
Neural Information Processing Systems (NIPS), 2017
Turning Corners into Cameras: Principles and Methods
International Conference on Computer Vision (ICCV), 2017
Generative modeling of audible shapes for object perception
International Conference on Computer Vision (ICCV), 2017
Motion microscopy for visualizing and quantifying small motions
Proceedings of the National Academy of Sciences (PNAS), 2017
Guest Editorial Special Issue on Extreme Imaging
IEEE Transactions on Computational Imaging 3 (3), 382-383
Synthesizing normalized faces from facial identity features
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
On the Effectiveness of Visible Watermarks
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
Eulerian Video Magnification and Analysis
Communications of the ACM, Vol. 60 No. 1, Pages 87-95, January 2017
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling
Neural Information Processing Systems (NIPS), 2016
Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks
Neural Information Processing Systems (NIPS), 2016. Oral presentation.
Video Camera–Based Vibration Measurement for Civil Infrastructure Applications
Journal of Infrastructure Systems, Vol 3 (2)
Ambient Sound Provides Supervision for Visual Learning
European Conference on Computer Vision (ECCV), 2016. Oral presentation.
Single Image 3D Interpreter Network
European Conference on Computer Vision (ECCV), 2016. Oral presentation.
Physics 101: Learning Physical Object Properties from Unlabeled Videos
British Machine Vision Conference (BMVC), 2016.
A Comparative Evaluation of Approximate Probabilistic Simulation and Deep Neural Networks as Accounts of Human Physical Scene Understanding
Annual Meeting of the Cognitive Science Society (CogSci), 2016. Oral presentation.
Visually Indicated Sounds
Computer Vision and Pattern Recognition (CVPR) 2016
Computational Imaging for VLBI Image Reconstruction
Computer Vision and Pattern Recognition (CVPR) 2016
Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning
Neural Information Processing Systems, 127-135 (NIPS), 2015
A computational approach for obstruction-free photography
ACM Transactions on Graphics (TOG) 34 (4), (SIGGRAPH), 2015
Modal identification of simple structures with high-speed video using motion magnification
Journal of Sound and Vibration vol 345, pages 58-71, 2015
Best-Buddies Similarity for Robust Template Matching
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
The Aperture Problem for Refractive Motion
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Video Magnification in Presence of Large Motions
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Visual Vibrometry: Estimating Material Properties from Small Motions in Video
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Developments with Motion Magnification for Structural Modal Identification Through Camera Video
Dynamics of Civil Structures, Volume 2, pages 49-57, 2015
Refraction Wiggles for Measuring Fluid Depth and Velocity from Video
European Conference on Computer Vision (ECCV), 2014
The Visual Microphone: Passive Recovery of Sound from Video
ACM Transactions on Graphics, Volume 33, Number 4 (Proc. SIGGRAPH), 2014.
Camouflaging an Object from Many Viewpoints
IEEE Computer Vision and Pattern Recognition (CVPR), 2014
Seeing the Arrow of Time
IEEE Computer Vision and Pattern Recognition (CVPR), 2014
A Compositional Model for Low-Dimensional Image Set Representation
IEEE Computer Vision and Pattern Recognition (CVPR), 2014
Accidental Pinhole and Pinspeck Cameras
International Journal of Computer Vision
110 (2), 92-112
Riesz Pyramids for Fast Phase-Based Video Magnification
International Conference on Computational Photography (ICCP), 2014.
Structural modal identification through high speed camera video
Topics in Modal Analysis I, Volume 7, pages 191-197, Springer International Publishing, 2014.
Estimating the Material Properties of Fabric from Video
2013 IEEE International Conference on Computer Vision (ICCV)
Group Norm for Learning Structured SVMs with Unstructured Latent Variables
2013 IEEE International Conference on Computer Vision (ICCV)
Shape Anchors for Data-Driven Multi-view Reconstruction
International Conference on Computer Vision (ICCV), 2013
Towards Longer Long-Range Motion Trajectories
British Machine Vision Conference (BMVC) 2012
Annotation Propagation in Large Image Databases via Dense Image Correspondence
European Conference on Computer Vision (ECCV), October 2012
Patch Complexity, Finite Pixel Correlations and Optimal Denoising
European Conference on Computer Vision (ECCV), October 2012
Shapecollage: Occlusion-Aware, Example-Based Shape Interpretation
European Conference on Computer Vision (ECCV), October 2012
Accidental pinhole and pinspeck cameras: revealing the scene outside the picture
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2012
Laser Speckle Photography for Surface Tampering Detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2012
Diffuse Reflectance Imaging with Astronomical Applications
IEEE Intl. Conf. on Computer Vision (ICCV), 2011
Evaluation of Image Features Using a Photorealistic Virtual World
IEEE Intl. Conf. on Computer Vision (ICCV), 2011
Blur Kernel Estimation Using the Radon Transform
Proc. 23rd IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011
Efficient Marginal Likelihood Optimization in Blind Deconvolution
IEEE Conf. on Computer Vision and Pattern Recognition, June 2011
Motion Denoising with Application to Time-lapse Photography
Proc. 23rd IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011
Where computer vision needs help from computer science
ACM-SIAM Symposium on Discrete Algorithms, January, 2011
Infinite Images: Creating and Exploring a Large Photorealistic Virtual Space
Proceedings of the IEEE, volume 98, issue 8, pages 1391 - 1407, 2010
Matching and Predicting Street Level Images
Workshop for Vision on Cognitive Tasks, European Conf. on Computer Vision (ECCV) 2010
Motion blur removal with orthogonal parabolic exposures
IEEE Intl. Conf. on Computational Photography (ICCP), 2010
The Patch Transform
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol. 32, issue 8, pages 1489 - 1501, August, 2010
A Content-Aware Image Prior
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
A Probabilistic Image Jigsaw Puzzle Solver
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Analyzing Spatially-varying Blur
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Latent Hierarchical Structural Learning for Object Detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Part and Appearance Sharing: Recursive Compositional Models for Multi-View Multi-Object Detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization
Communications of the ACM, March 2010, Vol. 53, No. 3
Ground-truth dataset and baseline evaluations for intrinsic image algorithms
International Conference on Computer Vision, 2009
Nonparametric Bayesian Texture Learning and Synthesis
Neural Information Processing Systems (NIPS) 2009
Segmenting Scenes by Matching Image Composites
Neural Information Processing Systems (NIPS) 2009
Informative Sensing of Natural Images
IEEE Int. Conf. Image Processing, Egypt, Nov. 2009
Understanding and evaluating blind deconvolution algorithms
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2009
Best paper award runner up
LabelMe: a Database and Web-based Tool for Image Annotation
International Journal of Computer Vision, 77(1-3):157-173, 2008
SIFT Flow: Dense Correspondence across Different Scenes
European Conference on Computer Vision, ECCV 2008
Understanding camera trade-offs through a Bayesian analysis of light field projections
European Conference on Computer Vision, ECCV 2008
80 million tiny images: a large dataset for non-parametric object and scene recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence., Volume 30 , Issue 11 (November 2008), Pages: 1958-1970
Creating and exploring a large photorealistic virtual space
First IEEE Workshop on Internet Vision, associated with CVPR 2008
Human-assisted motion annotation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008
The patch transform and its applications to image editing
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008
Best Poster Award, CVPR 2008
Unsupervised Discovery of Visual Object Class Hierarchies
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008
Describing visual scenes using transformed objects and parts
International Journal of Computer Vision, 77, May 2008
Signal and Image Processing with Belief Propagation
DSP Application Column, IEEE Signal Processing Magazine, Mar. 2008
Automatic estimation and removal of noise from a single image
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Vol 30, No. 2, pp. 299-314, Feb., 2008
A reliable skin mole localization scheme
2007 IEEE Workshop on Mathematical Methods in Biomedical Image Analysis (MMBIA), in conjunction with 2007 ICCV
Image and depth from a conventional camera with a coded aperture
ACM Trans. On Graphics (Proc. SIGGRAPH) 2007
Learning Compressed Sensing
45th Allerton Conference on Communication, Control, and Computing, 2007
Object Recognition by Scene Alignment
Advances in Neural Information Processing Systems (NIPS), 2007
Face Hallucination: theory and practice
International Journal of Computer Vision, Vol. 75, no. 1, pp. 115-134, October, 2007
Learning Gaussian Conditional Random Fields for Low-Level Vision
IEEE Computer Vision and Pattern Recognition (CVPR) 2007
What makes a good model of natural images?
IEEE Computer Vision and Pattern Recognition (CVPR) 2007
Sharing visual features for multiclass and multiview object detection
IEEE Transactions on Pattern Analysis and Machine Intelligence , vol. 29, no. 5, pp. 854-869, May, 2007
Exploring defocus matting: non-parametric acceleration, super-resolution, and off-center matting
IEEE Computer Graphics and Applications, special issue on Computational Photography, March, 2007
Analysis of contour motions
Advances in Neural Information Processing Systems (NIPS 2006)
Received Outstanding Student Paper Award
Bayesian model of human color constancy
Journal of Vision, 6, 1267-1281, doi:10.1167/6.11.10. 2006
Depth from familiar objects: a hierarchical model for 3d scenes
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Estimating Intrinsic Component Images using Non-Linear Regression
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Noise estimation from a single image
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Using multiple segmentations to discover objects and their extent in image collections
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Object detection and localization using local and global features
Lecture Notes in Computer Science (unrefeered). Sicily workshop on object recognition, 2005
Shared features for multiclass object detection
Towards Category-Level Object Recognition. Springer Lecture Notes in Computer Science (invited submission). 2005
Describing Visual Scenes using Transformed Dirichlet
Neural Information Processing Systems (NIPS), Vancouver, B.C., Dec. 2005
An Ensemble Prior of Image Structure for Cross-modal Inference
International Conference on Computer Vision (ICCV), Beijing, China, vol. 1, pp. 871-876, Oct. 2005
Discovering Objects and their Location in Images
International Conference on Computer Vision (ICCV), Beijing, China, Oct. 2005
Received 2017 Helmholtz prize, test-of-time award.
Learning Hierarchical Models of Scenes, Objects, and Parts
International Conference on Computer Vision (ICCV), Beijing, China, Oct. 2005
LabelMe: a database and web-based tool for image annotation
MIT AI Lab Memo AIM-2005-025, September, 2005
Recovering Intrinsic Images from a Single Image
IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 27, Issue 9, September 2005, Pages 1459 – 1472
Distributed Occlusion Reasoning for Tracking with Nonparametric Belief Propagation
Neural Information Processing Systems (NIPS) 2004
Using the forest to see the trees: a graphical model relating features, objects, and scenes
Advances in Neural Information Processing Systems 16 (NIPS), Vancouver, BC, MIT Press, 2004
Contextual Models for Object Detection Using Boosted Random Fields
Neural Information Processing Systems (NIPS), Vancouver, B.C., Dec. 2004
Single-frame Text Super-resolution: A Bayesian Approach
International Conference on Image Processing (ICIP), Oct. 2004
Efficient graphical models for processing images
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) Washington, DC, 2004
Sharing visual features for multiclass and multiview object detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) Washington, DC, 2004; MIT CSAIL technical report
Visual Hand Tracking Using Nonparametric Belief Propagation
Workshop on Generative Model Based Vision, CVPR, June 2004
Comparison of graph cuts with belief propagation for stereo, using identical MRF parameters
IEEE Intl. Conference on Computer Vision (ICCV), Nice, France, October, 2003
Context-based vision system for place and object recognition
IEEE Intl. Conference on Computer Vision (ICCV), Nice, France, October, 2003
Exploiting spatial and spectral image regularities for color constancy
3rd Intl. Workshop on Statistical and Computational Theories of Vision (associated with Intl. Conf. on Computer Vision), Nice, France, October, 2003
Exploiting the sparse derivative prior for super-resolution and image demosaicing
3rd Intl. Workshop on Statistical and Computational Theories of Vision (associated with Intl. Conf. on Computer Vision), Nice, France, October, 2003
Nonparametric Belief Propagation and Facial Appearance Estimation
IEEE Computer Vision and Pattern Recognition (CVPR), Madison, WI, June, 2003
Properties and Applications of Shape Recipes
IEEE Computer Vision and Pattern Recognition (CVPR), Madison, WI, June, 2003
Shape-Time Photography
IEEE Computer Vision and Pattern Recognition (CVPR), Madison, WI, June, 2003
Learning style translation for the lines of a drawing
ACM Transactions on Graphics, January, 2003
Shape Recipes: Scene Representations that Refer to the Image
Neural Information Processing Systems (NIPS) 2002
Example-based super-resolution
IEEE Computer Graphics and Applications, March/April, 2002.
Test-of-time award given in 2023 from IEEE CG&A.
Learning Joint Statistical Models for Audio-Visual Fusion and Segregation
Advances in Neural Information Processing Systems 13, edited by T. K. Leen, T. G. dietterich, and V. Tresp, pp. 772-778, 2001
Learning local evidence for shading and reflectance
International Conference on Computer Vision, Vancouver, BC, Canada, 2001
Learning Motion Analysis
Statistical Theories of the Brain, edited by R. Rao, B. Olshausen, and M. Lewicki, MIT Press, 2001
Bayesian Reconstruction of 3D Human Motion from Single-Camera Video
Advances in Neural Information Processing Systems 12, edited by S. A. Solla, T. K. Leen, and K-R Muller, 2000
Learning Low-Level Vision
International Journal of Computer Vision, 40(1), pp. 25-47, 2000
Separating style and content with bilinear models
Neural Computation 12(6), pp. 1247-1283, 2000
Markov networks for super-resolution
Proceedings of 34th Annual Conference on Information Sciences and Systems (CISS 2000), Dept. Electrical Engineering, Princeton University, Princeton, NJ 08544-5263, March, 2000
Learning low-level vision
Appeared in IEEE International Conference on Computer Vision, Corfu, Greece, 1999
Artificial retina chips as on-chip image processors and gesture-oriented interfaces
Optical Engineering, Vol. 38, No. 12, December, 1999
Computer vision for computer interaction
SIGGRAPH Computer Graphics magazine, November, 1999
An Inexpensive, All Solid-state Video and Data Recorder for Accident Reconstruction
Presented at the 1999 SAE International Congress and Exposition in Detroit, Michigan on March 3, 1999; published as SAE Technical Paper number 1999-10-1299
Markov networks for low-level vision
Presented at Workshop on Statistical and Computational Theories of Vision
Learning to estimate scenes from images
Neural Information Processing Systems, volume 11, 1999
A factorization approach to grouping
Proceedings, European Conference on Computer Vision, 1998
Bayesian model of surface perception
Neural Information Processing Systems, volume 10, pp. 787-793, 1998
Bayesian Estimation of 3-D Human Motion
Tech. Rep. TR98-06, Mitsubishi Electric Research Laboratories, Cambridge, MA, July 1998
Computer vision for interactive computer graphics
IEEE Computer Graphics and Applications, volume 18, number 3, May-June, pp. 42-53, 1998
Separating Style and Content
Neural Information Processing Systems 9, M. C. Mozer, M. I. Jordan and T. Petsche, Eds., Morgan Kaufmann, San Mateo, CA., 1997
Bayesian Color Constancy
Journal of the Optical Society of America, A, 14(7), pp. 1393-1411, July, 1997
Learning bilinear models for two-factor problems in vision
IEEE Conference on Computer Vision and Pattern Recognition (CVPR '97), Puerto Rico, U. S. A., June, 1997
Received Outstanding Paper prize, CVPR '97
Exploiting the generic viewpoint assumption
International Journal Computer Vision, 20 (3), 243-261, 1996
The generic viewpoint assumption in a Bayesian framework
Perception as Bayesian Inference, D. Knill and W. Richards, eds., Cambridge University Press, 365 - 390, 1996
Computer vision for computer games
, 2nd International Conference on Automatic Face and Gesture Recognition, Killington, VT, USA, pp. 100-105
Example-based head tracking
2nd International Conference on Automatic Face and Gesture Recognition, Killington, VT, USA.
A gesture controlled human interface using an artificial retina chip
IEEE Lasers and Electro-Optics (LEOS '96), July, 1996
Artificial retina chips as image input interfaces for multimedia systems
Optoelectronics and Communications Conference, OECC'96, Chiba, Japan, July, 1996
The steerable pyramid: a flexible architecture for multi-scale derivative computation
2nd Annual IEEE International Conference on Image Processing, Washington, DC. October, 1995
Bayesian decision theory, the maximum local mass estimate, and color constancy
Fifth International Conference on Computer Vision, IEEE Computer Society, Cambridge, MA, U.S.A, June, 1995, pp. 210 - 217
Orientation histograms for hand gesture recognition
International Workshop on Automatic Face- and Gesture- Recognition, IEEE Computer Society, Zurich, Switzerland, June, 1995, pp. 296-301
Winner, 2013 Test-of-time award from Face and Gesture Recognition conference. Here is a video prepared to accept the test-of-time award, describing the work in its context, in .mov format, or in .mpeg format.
Television control by hand gestures
International Workshop on Automatic Face- and Gesture- Recognition, IEEE Computer Society, Zurich, Switzerland, June, 1995, pp. 179-183
Bayesian method for recovering surface and illuminant properties from photosensor responses
Human Vision, Visual Processing and Digital Display V, SPIE Proceedings Series, vol. 2179, 1994
Computer vision for computer graphics
SIGGRAPH '94 and '95 course notes
Demonstration of an interactive environment for collaboration and learning
IEEE Computer, Vol. 27, No. 12, Dec. 1994
The generic viewpoint assumption in a framework for visual perception
Nature, vol. 368, p. 542 - 545, April 7, 1994
Exploiting the generic view assumption to estimate scene parameters
IEEE International Conference on Computer Vision, Berlin, Germany, 1993
Building and using catalogs of grey-level junctions
Proc. 15th European Conference on Visual Perception, Edinburgh, Scotland. August, 1993
Steerable Filters and Local Analysis of Image Structure
Ph.D. Thesis, Massachusetts Institute of Technology, 1992
Shiftable Multi-Scale Transforms
IEEE Trans. Information Theory, Special Issue on Wavelets. Vol. 38, No. 2, pp. 587-607, March 1992
The design and use of steerable filters
IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 13, no. 9, pp. 891 - 906, September, 1991
Motion without movement
ACM Computer Graphics, vol. 25, no. 4, (SIGGRAPH '91), pp. 27 - 30, July, 1991
A neural network for image noise removal
1st National Conference on Neural Networks and their Applications, Beijing, 1990
(in Chinese)
Pyramids and multiscale representations
Proc. 13th European Conference on Visual Perception, Paris, 1990
Steerable filters for early vision, image analysis, and wavelet decomposition
IEEE International Conference on Computer Vision, Osaka, Japan, 1990
Helmholtz Prize--test-of-time award winner.
Applications of neural networks in image processing
Automation Soc. of China Symp. on Neural Networks, pp. 46 - 55, Beijing, 1989
(in Chinese)
Steerable filters
OSA Topical Meeting on Image Understanding and Machine Vision, Technical Digest Series Volume 14, June, 1989
Image processing to remove grain from photographs
Society of Photographic Scientists and Engineers 42nd Annual Conference, pp. 457 - 460, May, 1989
Computer Image Processing of STEM Images of Tobacco Mosaic Virus
Ultramicroscopy 6, 367-76 (1981)