Rahul Shukla

My Work

Data Science

Automating the Economic Census Using machine learning to automate industry classification for the Economic Census I built a hierarchical machine learning model (scikit-learn) to generate industry classification codes in preparation for the 2022 U.S. Economic Census. Our estimates show that my work could save the U.S. Census Bureau up to $1.5 million. We presented our project to audiences as large as 300 people, notably in front of a former U.S. CTO, the Administrator of the United States Digital Service, and the Deputy Director of the U.S. Census Bureau. Data Science internship at the U.S. Census Bureau in Washington DC. Part of Civic Digital Fellowship's 2019 Cohort. Skills
Python scikit-learn NLTK Pandas NumPy
Links
Slides Medium
Contagion in School Shootings Analyzing the school shooter copycat syndrome using computational methods I built a data collection protocol that includes a document classification pipeline, a classification schema to categorize news articles, and a web-scraping tool for LexisNexis. Hoping to publish by end of year. Undergraduate Research Assistant under Professor Adam Pah. Part of the Amaral Lab. Awarded the 2020 Weinberg Summer Grant. Skills
Python scikit-learn NLTK Selenium Time-Series Analysis Regression Analysis Stochastic Modeling
Links
Coming Soon

Product

Northwestern Open Data Initiative Building Northwestern University's first open data portal for any Northwestern-related datasets I led a team of seven other developers through fifteen agile sprints, focusing on strategic vision, technical execution, product design, dataset acquisition, and partnerships. Some of my contributions include: defining and presenting a short-term and long-term product vision and strategy to multiple stakeholders including members of the Northwestern faculty and administration; spearheading strategic partnerships with the winning Associated Student Government campaign, the Stanford Open Data Project, as well as Northwestern's Institutional Research Office, VP of Analytics, and Data Governance Group; and creating wireframes, defining and priortizing key features, and setting a timeline with our development team. We are looking to launch this fall! Founder and President of the Northwestern Open Data Initiative Links
Website Coming Soon Press
Evanston Zoning Visualization Tool Visualizing zoning laws and accessory dwelling units (ADU) for the Evanston Development Cooperative I led a team of three developers through ten agile sprints, focusing on design, technical execution, and user testing. Some of my contributions include: working with the CEO of EDC to define product requirements, compartmentalizing the problem, and creating product wireframes. Product Manager, Team Lead at Develop + Innovate for Social Change club at Northwestern. Links
Github

Policy

Open Data for Universities Creating an open data handbook for students and universities Coming Soon Fall 2021 Links
Coming Soon
Analyzing educational technology Researching the Matthew Effect in India I wrote a report on the Matthew Effect and its impact on educational technology in India, namely looking at Khan Academy's recent partnership with Tata. Research internship at The World Bank Links
Report