When you think of the word “superhero” what do you imagine? Language reflects and reinforces social norms; ungendering language is a vital part of interrogating sexism. However, there’s no dataset of gendered words. This talk is about data – where to get it and how to create it if it doesn’t exist. In this talk, LinkedIn software engineer Omayeli Arenyeka creates the dataset for The Gendered Project, showing how to view unavailable data as an opportunity rather than an obstacle to answering questions.