Feature Selection for Mixed Models

With the guidance of Dr. Aleksander Aravkin.

skmixed

For my PhD thesis, I develop feature selection methods that work well on ill-conditioned data. Such data arises when the features are correlated or when they contain missing values and outliers. To stabilize model training on such data I use optimization methods that have implicit regularization properties. For instance, SR3 methods implicitly smooth the level-sets of the loss-functions via partial minimization. I used it to create a stable feature-selection method for linear mixed-effects models. Check out my implementation of it here.

COVID-Project

Together with IHME Math Sciences Team and 130+ colleagues across IHME. memoir

Together with IHME Math Sciences Team, I worked on the math behind IHME SEIIR COVID Model, which has been in use staring May, 5th, 2020. We made one of the first COVID-19 models that attracted attention of decision makers across the world and provided guidance for the distribution of ICUs during the first surge of the pandemic.

MEMOIR

Jointly with Yury Maximov, Massih-Resa Amini, and Anton Belyy.

memoir

In this project we use novel similarity search techniques to bring multi-class SVM to the realm of extra-large datasets (billions or objects, hundreds of thousands of classes). In particular, we integrated maximal inner product search search techniques like SimpleLSH and SmallWorldGraph into the Pegasos SVM. It boosted the convergence speed and gave us a flexible memory-precision trade-off mechanism. See the preprint for more details.

Bandits with Knapsacks for Demand Control in Power Grids

Jointly with Yury Maximov, Michael Chertkov, and Michail Kretchetov

tcl

In this work we used reinforcement learning to control electricity demand of an ensemble of thermostatic appliances. Mathematically, we modelled the loads with Markov chains and then trained a multi-armed bandit to control them by sending centralized policies. The main challenge was that the bandit had a limited budget and he was charged by the loads for every consumption curtailment act. We adapted a linear contextual bandit with a knapsack to this Markov chains setting, and it showed a better performance comparing to existing solutions on synthetic experiments. More details are in my bachelor thesis.

RySearch

Jointly with Anton Belyy, Mariia Selezniova, and Konstantin Vorontsov.

rysearch

We used a hierarchical topic modeling technique called BigARTM to create an exploratory search engine for large heterogeneous corpora of Russian scientific texts. We created a prototype of such an engine, and we also created several new metrics of quality for hierarchical topic models. You can play with the prototype here!