Anton Osokin
Isomorphic Labs
Research Scientist
preferred contact:
Bio | Publications | Teaching
I'm an ML researcher most excited about working with complicated objects with a lot of structure. Currently, I work with molecules at Isomorphic Labs. In the past, I used to work with code, text and images as an associate professor at HSE University, Moscow, Russia and researcher at Yandex lab/Yandex Research. I did both my undergrad and PhD studies in computer science at Lomonosov Moscow State University with Bayes Group and a postdoc in ML/CV at INRIA/École Normale Supérieure in Paris.
Selected publications
SPARQLing Database Queries from Intermediate Question Decompositions
In proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
[paper and supplementary]
[bibtex]
[code and data]
@inproceedings{saparina21sparqling, title = {{SPARQLing} Database Queries from Intermediate Question Decompositions}, author = {Irina Saparina and Anton Osokin}, booktitle = {proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP)}, year = {2021} }
OS2D: One-Stage One-Shot Object Detection by Matching Anchor Features
In proceedings of the European Conference on Computer Vision (ECCV), 2020
[paper and supplementary]
[bibtex]
[code and data]
[teaser video (1 min)]
[video (10 min)]
@inproceedings{osokin20os2d, title = {{OS2D}: One-Stage One-Shot Object Detection by Matching Anchor Features}, author = {Anton Osokin and Denis Sumin and Vasily Lomakin}, booktitle = {proceedings of the European Conference on Computer Vision (ECCV)}, year = {2020} }
Quantifying Learning Guarantees for Convex but Inconsistent Surrogates
Advances in Neural Information Processing Systems (NeurIPS), 2018
[paper and supplementary]
[bibtex]
[teaser video (3 min)]
[seminar talk (90 min)]
@inproceedings{struminsky18consistency, title = {Quantifying Learning Guarantees for Convex but Inconsistent Surrogates}, author = {Kirill Struminsky and Simon Lacoste-Julien and Anton Osokin}, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, year = {2018} }
Marginal Weighted Maximum Log-likelihood for Efficient Learning of Perturb-and-Map Models
In proceedings of the international conference on Uncertainty in Artificial Intelligence (UAI), 2018
SEARNN: Training RNNs with Global-Local Losses
In proceedings of the International Conference on Learning Representations (ICLR), 2018
[paper and supplementary]
[bibtex]
[webpage/data]
[code]
@inproceedings{searnn2018leblond, author = {Leblond, R\'emi and Alayrac, Jean-Baptiste and Osokin, Anton and Lacoste-Julien, Simon}, title = {\textsc{SeaRnn}: Training RNNs with Global-Local Losses}, booktitle = {ICLR}, year = {2018} }
On Structured Prediction Theory with Calibrated Convex Surrogate Losses
Advances in Neural Information Processing Systems (NIPS) (oral presentation), 2017
[paper and supplementary]
[bibtex]
[code]
[poster]
[teaser video (3 min)]
[NIPS talk (15 min)]
@inproceedings{osokin17consistency, title = {On Structured Prediction Theory with Calibrated Convex Surrogate Losses}, author = {Anton Osokin and Francis Bach and Simon Lacoste-Julien}, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, year = {2017} }
GANs for Biological Image Synthesis
In proceedings of the International Conference on Computer Vision (ICCV), 2017
[paper and supplementary]
[bibtex]
[code/data]
[poster]
[ICCV spotlight (3 min)]
@InProceedings{osokin2017biogans, author = {Anton Osokin and Anatole Chessel and Rafael E. Carazo Salas and Federico Vaggi}, title = {{GANs} for Biological Image Synthesis}, booktitle = {proceedings of the International Conference on Computer Vision (ICCV)}, year = {2017} }
Minding the Gaps for Block Frank-Wolfe Optimization of Structured SVMs
In proceedings of the International Conference on Machine Learning (ICML), 2016
[paper and supplementary]
[ICML talk (15 min)]
[bibtex]
[webpage]
[code]
@InProceedings{osokin16gapBCFW, author = {Anton Osokin and Jean-Baptiste Alayrac and Isabella Lukasewitz and Puneet K. Dokania and Simon Lacoste-Julien}, title = {Minding the Gaps for Block {F}rank-{W}olfe Optimization of Structured {SVM}s}, booktitle = {proceedings of the International Conference of Machine Learning (ICML)}, year = {2016} }
Breaking Sticks and Ambiguities with Adaptive Skip-gram
In proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), 2016
[paper]
[supplementary]
[bibtex]
[code]
@inproceedings{bartunov16adagram, title = {Breaking Sticks and Ambiguities with Adaptive {S}kip-gram}, author = {Sergey Bartunov and Dmitry Kondrashkin and Anton Osokin and Dmitry Vetrov}, booktitle = {proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS)}, year = {2016} }
Context-aware CNNs for person head detection
In proceedings of the International Conference on Computer Vision (ICCV), 2015
Tensorizing Neural Networks
Advances in Neural Information Processing Systems (NIPS), 2015
Submodular relaxation for inference in Markov random fields
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 37(7): 1347-1359, 2015
[paper]
[bibtex]
[code]
@article{osokin15smr, Title = {Submodular relaxation for inference in {M}arkov random fields}, Author = {Anton Osokin and Dmitry Vetrov}, Journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, Year = {2015}, Number = {7}, Pages = {1347--1359}, Volume = {37} }
Perceptually Inspired Layout-aware Losses for Image Segmentation
In proceedings of the European Conference on Computer Vision (ECCV), 2014
Putting MRFs on a Tensor Train
In proceedings of the International Conference on Machine Learning (ICML), 2014
[paper]
[supplementary]
[bibtex]
[code]
@inproceedings{novikov14tensorTrainMRF, title = {Putting {MRFs} on a Tensor Train}, author = {Alexander Novikov and Anton Rodomanov and Anton Osokin and Dmitry Vetrov}, booktitle = {proceedings of the International Conference on Machine Learning (ICML)}, year = {2014} }
A Principled Deep Random Field Model for Image Segmentation
In proceedings of the Computer Vision and Pattern Recognition (CVPR), 2013
[paper]
[supplementary]
[bibtex]
[code]
@inproceedings{kohli13segmentation, title = {A Principled Deep Random Field Model for Image Segmentation}, author = {Pushmeet Kohli and Anton Osokin and Stefanie Jegelka}, booktitle = {proceedings of the Computer Vision and Pattern Recognition (CVPR)}, year = {2013} }
Minimizing Sparse High-Order Energies by Submodular Vertex-Cover
Advances in Neural Information Processing Systems (NIPS), 2012
Fast Approximate Energy Minimization with Label Costs
International Journal of Computer Vision (IJCV), 96(1):1-27, 2012
[paper]
[bibtex]
[code]
@article{delong12labelCosts, Title = {Fast Approximate Energy Minimization with Label Costs}, Author = {Andrew Delong and Anton Osokin and Hossam Isack and Yuri Boykov}, Journal = {International Journal of Computer Vision (IJCV)}, Year = {2012}, Number = {1}, Pages = {1--27}, Volume = {96} }
Submodular Decomposition Framework for Inference in Associative Markov Networks with Global Constraints
In proceedings of the Computer Vision and Pattern Recognition (CVPR), 2011
[paper]
[techreport]
[bibtex]
[code]
@inproceedings{osokin11smd, title = {Submodular Decomposition Framework for Inference in Associative {M}arkov Networks with Global Constraints}, author = {Anton Osokin and Dmitry Vetrov and Vladimir Kolmogorov}, booktitle = {proceedings of the Computer Vision and Pattern Recognition (CVPR)}, year = {2011} }
Fast Approximate Energy Minimization with Label Costs
In proceedings of the Computer Vision and Pattern Recognition (CVPR), 2010
PhD thesis
Submodular relaxation for energy minimization in Markov random fields
Lomonosov Moscow State University. 2014. In Russian
[text (pdf) (in Russian)]
[synopsis (pdf) (in Russian)]
[code]
see TPAMI 2015 paper for the English version
see TPAMI 2015 paper for the English version
Teaching
Deep learning at CS HSE, Moscow, Russia. Lecturer. 2018, 2019-spring, 2019-fall, 2020, 2021
All the materials (in Russian) are available online (lecture slides, seminars, recorded lectures)
All the materials (in Russian) are available online (lecture slides, seminars, recorded lectures)
Introduction to discrete optimization at CentraleSupélec, Châtenay-Malabry, France. Co-lecturer with Karteek Alahari. 2016
Graphical models at CMC MSU. Seminars and practical sessions. Lecturers: Dmitry Vetrov, Dmitry Kropotov.
2012, 2013, 2014
2012, 2013, 2014
Graphical models at Yandex Data Analysis School. Seminars and practical sessions. Lecturers: Victor Lempitsky, Dmitry Vetrov.
2011, 2012, 2013
2011, 2012, 2013
Machine Learning at CMC MSU. Practical sessions.
2012, 2013, 2014
2012, 2013, 2014
Scientific seminar on Bayesian methods of machine learning at CMC MSU. Co-organizer together with Dmitry Vetrov and Dmitry Kropotov.
2010-2014
2010-2014
GitHub
Twitter