Projects

My research is focused on developing deep latent variable models (DLVMs), such as the VAE, with provable properties of the distribution in the latent space that find application in outlier detection and disentanglement analysis. I have developed a method to find the relevant latent variables in DLVMs that are sufficient to model a data distribution, representing the intrinsic dimensions of the dataset. In addition, I am interested in training deep neural networks with limited annotated data, and I have proposed methods to interpret biomedical and seismic images using Gaussian processes in the few-shot setup. Besides probabilistic modeling, I have worked on registering 3D scans (RGB-D data).

As a research assistant, I have collaborated with researchers from ExxonMobil, USA, on multiple projects related to interpreting seismic images, such as few-shot segmentation, multitask learning, and explainable AI. I also completed a data science-computer vision internship at Ancestry, where I gained valuable experience working with large-scale image data and developing innovative solutions.



ARD-VAE: A Statistical Formulation to Find the Relevant Latent Dimensions of Variational Autoencoders

Surojit Saha, Sarang Joshi, and Ross Whitaker (2024). (under review)
PDF Code



AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies

Surojit Saha, and Ross Whitaker (2024) (under review)
PDF Code



Matching Aggregate Posteriors in the Variational Autoencoder

Surojit Saha, Sarang Joshi, and Ross Whitaker, " Matching Aggregate Posteriors in the Variational Autoencoder ", International Conference on Pattern Recognition (Early Accept), 2024.
PDF Code



Multi-task Training as Regularization Strategy for Seismic Image Segmentation

Surojit Saha*, Wasim Gazi*, Rehman Mohammed, Thomas Rapstine, Hayden Powers, and Ross Whitaker, " Multi-task Training as Regularization Strategy for Seismic Image Segmentation", IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023.
*=Equal Contribution
PDF



GENs: Generative Encoding Networks

Surojit Saha, Shireen Elhabian, Ross Whitaker, " GENs: Generative Encoding Networks ", Machine Learning, 2022.
PDF



GP-UNet: Few-Shot Segmentation of Microscopy Images Using Gaussian Process

Surojit Saha, Ouk Choi, Ross Whitaker, " Few-Shot Segmentation of Microscopy Images Using Gaussian Process ", MOVI, MICCAI workshop, 2022.
PDF

You can find my publications on my Google Scholar profile.