Shashank Sule

Collective variable discovery

posted on 5 Jul 2025

Last month marked the culmination of a two major projects regarding collective variable discovery, a fundamental interdisciplinary problem in drug discovery, computational statistical physics, and stochastic processes. From a probabilistic perpsective, this problem asks: how can we map a stochastic process to low dimensions and still preserve its statistics? In two case study-style papers on the butane molecule and Lennard-Jones clusters we provide some answers by resorting to quantitive coarse graining theory and proposing algorithms that use some of my favourite tools from geometric data science.

Two new papers

posted on 12 Feb 2025

Two papers I’ve been working on are out. One tells a story about how you can combine classical and deep learning methods for magnetic resonance imaging in the brain. The other one shows how to use the Neumann eigenvectors of subgraphs for dimension reduction–with nearly isometric embeddings!

New conference paper!

posted on 28 Aug 2024

Our paper on Short Time Fourier Transform (STFT) phase retrieval was recently published at the 34th European Signal Processing Conference! We showed that neural networks can perform speech phase retreival given far fewer samples than what is mathematically required. Read it here.

Mathematical Research Community on AI

posted on 13 Jul 2024

I participated in the super exciting Mathematical Research Community (MRC) on Explainable, Interpretable, and Adversarial AI at Beaver Hollow, NY! A lot of things were discussed–including the capabilities of transformers, latent geometries of language models, and neural collapse. You can hear more on these matters at the upcoming Joint Math Meetings in Seattle at the special session for this MRC.

New preprint

posted on 25 Dec 2023

‘Tis the season to drop preprints! My work titled Sharp error estimates for target measure diffusion maps with applications to the committor problem with Dr. Maria Cameron and Luke Evans is submitted and up on arXiv. This paper is about leveraging the approximation theory of diffusion maps to gain concrete speedups in accuracy for applications in molecular dynamics.

Diffusion models workshop!

posted on 8 Dec 2023

I attended the Measure Transport, Diffusion Processes and Sampling Workshop at the Flatiron Institute! My poster featured new results on Target Measure Diffusion Maps (TMDmap), including a stability estimate for Dirichlet BVPs using TMDmaps and an application to machine learning-based rare event quantification in the butane molecule.

Optimal transport in Data Science!

posted on 12 May 2023

I recently presented a poster titled Error analysis of Target Measure Diffusion Maps and applications to transition path theory at the Optimal Transport for Data Science workshop at ICERM, Brown University!

Fall 2023 RIT: Machine Learning For Rare Events

posted on 13 Sep 2022

Next meeting: December 4th, 3:00 pm, Kirwan Hall 1310. Speaker: Meenakshi Krishnan

Summer Update!

posted on 12 Sep 2022

I spent last week attending the Applied Harmonic Analysis and Machine Learning Summer School at the Machine Learning Genoa Center in Genova, Italy!

Summer updates

posted on 24 Aug 2021

I worked on creating and implementing a new algorithm for using information theory to make phylogenetic trees from aligned DNA sequences at the Cummings Lab in UMD.

This project motivated many interesting biological questions, but here is a mathematical one that caught my fancy: Given a function \(f: \{0,1\}^n \to \mathbb{R}\), can one find a polynomial time algorithm to compute the global maximum of \(f\)? One might be tempted to say “ah this is just integer programming” but what happens when there is no meaningful extension of \(f\) to \([0,1]^n\)? This problem arises in computing the optimal split in a tree, where we must maximize the function \(\vert A \vert H(A) + \vert A^c \vert H(A^c)\) over all possible partitions \(A \sqcup A^c = X\) of a finite set \(X\). Here \(H(A)\) is the (empirical) entropy of \(A\). Email me if you have suggestions and find out more about this project here.

New Projects!

posted on 4 Jun 2021

The initial commit of projects is in (hopefully with many more to come). Particular highlights include my undergraduate thesis, code for a couple of projects, and a stash of notes meant for the cramming undergraduate (aka me 2 years ago). Check them out here.

Website is up!

posted on 4 Jun 2021

Finally the website is up! It looks a bit minimal, but that’s kind of the style I meant to go for. I used this template designed by Marc Weitz to start with. The wonders of open source never cease to amaze.