Hello everyone! I am Disha Shrivastava, a Senior Research Scientist at Google DeepMind, London. I am a core-contributor to the Gemini project focused on boosting its coding and reasoning abilities through post-training recipes, data, and evals. In addition, I am actively developing general-purpose agents designed to achieve superhuman performance in critical environments for humanity. Notably, I played a key role in developing Jules, an asynchronous coding agent. My interests also extend to designing better human-agent interfaces.
I did my PhD in AI at Mila, working with Hugo Larochelle and Danny Tarlow . My thesis focused on developing novel methods to identify and leverage contextual cues to improve deep learning models of code. During my PhD, I also worked as a Student Researcher at Google Brain, as a Research Scientist Intern at DeepMind working on AlphaCode, and as a Visiting Researcher at ServiceNow Research.
Before my PhD, I was a Research Software Engineer at IBM Research, India where I worked on unsupervised knowledge graph construction, computational creativity metrics, and reasoning for maths question-answering. I hold a Masters in Computer Technology from IIT Delhi, where I developed a data and model-parallel framework for training of deep networks in Apache Spark. My Bachelors degree is in Electronics and Communication Engineering from BIT Mesra.
I co-organized workshops on Deep Learning for Code (ICLR 2022-23), AIPLANS (NeurIPS 2021) and Neurosymbolic Generative Models (ICLR 2023). Outside of work, I like travelling, cooking, reading books, singing and blogging!