robert pless

Updates:

The Gates Foundation is funding our pilot project to explore how LLMs can support Frontline Health Workers
I'm serving as (co-)Program Chair of CVPR 2024; I hope to see you in Seattle!
I'm excited to be part of the brand new UMD + GW + Morgan State NSF Trails Institute, Trustworthy AI in Law and Society.
Apply to our NSF NRT program on Designing Trustworthy AI Systems to have formal training in thinking about risks and challenges of real-world AI systems (and get a year of flexible funding for your PhD!).
Comparing Deep Learning Approaches for Understanding Genotype × Phenotype Interactions in Biomass Sorghum" appeared in Frontiers in AI.
"GradCAM" style visualization tools for image similarity metrics learned by transformers "Visualizing Paired Image Similarity in Transformer Networks [WACV 2022].

Research

My research is in the area of computer vision with applications to environmental science, medical imaging, robotics and virtual reality. I am particularly interested in data-driven and geometric techniques to more robustly understand images taken "in the wild". This research exploits the fact that cameras are incredibly precise measurement systems --- if they are calibrated properly, then the vast quantities of visual data they collect can help to learn, understand, and manipulate the world around us. At a high level, the current themes of research in my lab are:

Understand visual change at scales from the sidewalk to the planet:
What can you do with a billion images? Webcams, iPhones, and flocks of micro-satellites a visual depiction of the Earth at an unprecedented temporal and spatial coverage. Our aim is to organize these disparate image sources to create coherent global imaging systems that answer important questions facing our society and our planet. Specifically, we work to understand physical principles that govern image formation in realistic environments, and we use those in application domains that include: Understanding patterns of tree-growth at a continental scale, characterizing the use of public spaces over time, and creating generalizable models to learn how image appearance varies over time.
Next generation imaging systems:
All Virtual Reality and Robotics applications require fast visual reasoning systems to characterize the 3D world and pose of objects within that world. The objective of this research effort is to understand and overcome the fundamental limits of cameras and materials that make that task difficult. Specifically, our work has developed the theory of motion estimation from multi-perspective and compressive sensing cameras, fundamental constraints for calibrating cameras with other sensing systems, and the design of new light-field modulating materials that make geometric inference especially easy.
Democratizing Visual Analytics and Applications to Social Justice:
What can our community do to make visual reasoning more available to more people? Our lab works to make web implementations and smart-phone apps that support the broader publics ability to create and use visual inference across many applications. This includes apps to support Citizen Science through repeat photography, and the geocalibration.org website that allows the precise geo-location of an image with respect to Google Maps which was used to find the lost grave of a Jane Doe crime victim from 30 years ago.

Dataset, Apps, and Frequently Requested Code

The Archive of Many Outdoor Scenes (AMOS).
TraffickCam, an app to crowd source the creation of a continually updated index of the appearance of hotel rooms to support investigations of sex trafficking.
Citizen Science Repeat Photography App and Data (rePhoto).
Code (12 KB zip file) for matlab and web visualizations of image manifolds, from the paper: "A Survey of Manifold Learning for Images", Robert Pless and Richard Souvenir, IPSJ Transactions on Computer Vision and Applications, vol. 1 pp. 83-94, 2009. (bib)
Code (2.3 MB zip file) from our paper: "Extrinsic Calibration of a Camera and Laser Range Finder", Qilong Zhang, Robert Pless, IROS 2004, (bib),

Updates:

Research

Dataset, Apps, and Frequently Requested Code