This is the collection of all the final projects for my introductory statistics college classes. Student names are linked to their Github accounts. Project titles are linked to the associated website. Note that some data/sites are not publicly available (as indicated with a “*’’) because of limitations placed on the project by the organization which supplied the data. Projects must be solo, except for a 2 or 3 exceptions each semester.
Readers might also be interested in the final projects created in my free data science course for high school students as well as in the final projects for my second semester statistics course.
Nosa Lawani: Towards Understanding Gun Homicide in the 50 States. The project compares correlates of gun violence across the 50 states.
Hana Kim: How NYC Students of Different Racial Groups Perform Academically Over Time. My project explores how students of different racial groups in NYC perform on benchmark exams over time.
Shai-Li Ron: Covid-19 Vaccination Rates for Cities of Different Socio-econmic Status. This project investigates how socio-economic status impacts vaccination of cities in Israel, a country which has been leading with the highest vaccination rates.
Mohit Mandal: The Future of Cricket: the Evolution of Twenty20. An investigation of how T20 cricket has evolved since its inception in the early 2000s.
Trevor Cobb: Healthcare Costs and Barriers to Care in the United States. An exploration of the relationship between healthcare costs and barriers to care in the United States.
Alice Chen: Tech and Money: Global Access and Ownership of Personal Technology. This project explores the relationship and change in personal tech device ownership and internet access of people globally.
Fahad Alkhaja: Managing Expectations: xSoccer Data. I look at and analyze different soccer stats with a few sample cases, specifically xG, xPts and models I generated.
Alex Tsotadze: The Effects of COVID-19 on Domestic Disputes in US Cities. I analyze how new COVID-19 cases affect domestic dispute calls in certain US Cities, specifically Baltimore, Cincinniati, Los Angeles, Orlando and Seattle.
Caroline Behrens: Behind Bars: How Prison Statisitcs Vary Between Northern and Southern States. My project explores the differences between prisoners in northern and southern states.
Dennis Blyashov: Kittbio Labs: An Exploration into Customer Demographics. The applications leverages the power of google search trends to explore specific keywords and build customer demographic models.
Scott Bek: COVID-19 Death Rate and Air Pollution Level in China. My project explores whether air polution level is related to COVID-19 death rate in China.
Christopher Snopek: Hitter Evaluations Regarding Barrels. This project investigates which MLB players are undervalued and overvalued based on the percentage of hits they have that are hit hard.
Shaked Leibovitz: Happiness, Freedom and Gender Inequality: Exploration Accross Countries. My project explores levels of gender inequality in different countries all over the world and the relation to happiness and perceived freedom to make life choices.
Nana Koranteng: Coups and Stability: How the Middle East Compares to the Rest of the World. My project explores how Middle Eastern countries compare to the rest of the world in terms of coup related events.
Sarah Brashear: Student Achievement and Socioeconomic Status in U.S. Public Schools. I analyzed the relationship between student achievement and socio-economic status in U.S. public schools.
James Fitz-Henley: Distributing Opportunities: Economic and Demographic Predictors of Opportunity Zone Designation. How do a census tract’s economic and demographic conditions predict its designation as an Opportunity Zone in the American southeast?
Katherine McPhie: Social Connectedness in the Harvard Class of 2024. Analyzing how Harvard first-years have been forming social connections during the COVID-19 pandemic. This project is joint work with Elliott Detjen, Giovanni Salcedo, and Ava Swanson.
Ryan Zhang: Strategic and Sincere Considerations in American Presidental Primaries. Exploring how often, and the extent to which, primary voters balance sincere concerns (candidate favorability and ideology) and strategic concerns (candidate viability/electability) in American presidential contests.
Ben Lee: A Minnesota Fisherman’s Guide to the Spring 2021 Season. This project gives recommendations for the best Northern Minnesota lakes for walleye fishing for the 2021 season.
Julia Blank: International Judging Biases in Ice Dance: Analyzing the 2018 Olympic Season. An investigation into whether a judge’s nationality bias is reflected in the scores of Olympic level Ice Dancers.
Akila Muthukumar: English Requirements in US Clinical Trials. An analysis of English and Spanish language requirements in eligibility criteria for neurological and mental health related clinical trials.
Alexa Jordan: Women in the Courts: Parties and Presidents. This project looks at the progress we have made since Sandra Day O’Connor in 1981 and investigates the role of sexism in the continuitng lack of gender parity.
Carter Martindale: SCOTUS Political Leanings. A look at how the Supreme Court has ruled on various issues and an attempt at predicting how the current court will vote.
Sara Park: Polity and the Social Institutions of Gender Discrimination. An analysis on the relationship between polity and social institutions of gender discrimination.
Daniel Salgado-Alvarez: When All Work Means All Play: International Tourism in Mexico. Analyzing what different tourist attractions and government interventions generate the most tourism GDP using data from all the states of Mexico
Amaya Sizer: An analysis of foundational styles for ranked UFC fighters. An analysis of the different foundational styles of ranked fighters, and how they are related to number of wins and rankings.
Monica Chang: Justice Delayed is Justice Denied: Litigating Delays in Disability Accommodation for Homeless Families. My project investigates whether disability-related needs are being met for homeless individuals within a reasonable timeframe in support of the Greater Boston Legal Services’ class action lawsuit against the MA EA Shelter system.
Nick Maxwell: Examining the Ideology of the Roberts Court. An analysis of the voting history of the Roberts Court by ideology and issue.
Alexander Park: Home Field Advantage in the NFL, MLB, and NBA. An analysis of the importance of home field advantage in America’s three largest professional sports leagues.
Esther Kim: Modern Patterns in Korean Immigration. A look at modern patterns in Korean immigration through the relationship between types of visas and national economic growth.
Nick Brinkmann: Best To Test: Analyzing My Online Chess Games. Investigates trends in my online chess games over the past 2-3 years, including rating changes and performance in different openings; attempts to predict game outcomes based upon various characteristics.
Osvaldo Cervantes: An Analysis of Gaming, Gender, and Psychology. A study into how behavioral factors such as gender identity, anxiety, or other behavioral descriptors are revealed through gaming stats and real world metrics.
Will Rowley: Starting Pitchers: A pitch by pitch analysis of the 2019 MLB season. A look at how different starting pitchers compare to one another in pitch selection, pitching under pressure, and performance statistics, with an analysis of the effect of pitch metrics on WHIP and ERA.
Yifan Chen: Analyzing Trends in College Financial Aid. My project will look at how percentages in different types of financial aid has changed over time, and how it varies depending on student characteristics.
Jasmine Hyppolite: Hashtags As Social Movements: Can They Impact Legislation? This project looks at how the popularity of hasthags for the Black Lives Matter Movement and Me Too movement correlate with the presence of bills regarding related topics in the NY Legislature to understand how impactful hashtag activism is.
Dash Chin: kanye omaRi west. An analysis of music streaming and writing trends through the lens of music by Kanye Omari West.
Sophie Bauder: The Impact of COVID-19 on Education. An analysis of how COVID-19 cases impact school reopenings, and how the pandemic as a whole effects edcators.
Loic Tagne: The Splash Brothers. Analyzing Stephen Curry and Klay Thompson’s 2015-2016 NBA Season
Michelle Kurilla: Analyzing Case Salience and Majority Opinion Assignments by Chief Justices. A look at how case salience impacts the frequency of majority opinion assignments by Chief Justices to themselves
Ang Sonam Sherpa: Gender Representation in Politics - Case Study Nepal. Analyzing the political representation in Nepal along gender lines with regressions of various demographic, developmental and gender-related characteristics in a district
Hope Kudo: How the Average American Spends Their Time. Exploring the American time usage survey and looking at how demographic and socioeconomic factors influence how people spend their time
Josh Willcox: Natural Resources and State Structure. Exploring the relationship between types of state revenue and regime type
Neloy Shome: Covid-19: Domestic Impact and International Pandemic Management. Analyzing Covid-19 impact on mental health disorders and racial minorities.
John Chua: Food Insecurity in San Francisco. My project visualizes and predicts risk factors for food insecurity in San Francisco in support of Alemany Farm’s Community Food Project Program.
Ton-Nu Nguyen-Dinh: The Dam Building Boom. An interactive map visualization of dams and reservoirs in the world and analysis of factors influencing their construction.
Joshua Berry: Economic Indicators for G20 Attitudes towards China: How Trade Flows Predict Public Opinion. Analyzing how international trade flows and macro-economic data predict individual, micro-level attitudes towards China among the G20 countries.
Kai McNamee: Racial Geography of the MBTA. Examines the racial geography of Boston’s public transit system.
Ruby Huang: NCAA D1 Women’s Volleyball 2011-2019. This project looks at the average performance of top teams and individual players in the nation across every skill category from 2011 to 2019.
Taryn O’Connor: Analysis of US Senate Elections, 1980-2018. An analysis of the relationship between Senate campaign expenditures and election outcomes.
Justin Qi: Equitable Lending? Don’t Bank on It: Using Waiting Times to Measure Racial Disparities in the Paycheck Protection Program. Exploring the potential role of taste-based discrimination in Paycheck Protection Program loans.
Jack Murphy: UNGA Regional Voting Patterns. An interactive visualisation of regional voting patterns in the UN General Assembly, allowing comparisons at both the region- and country-level.
Hollyn Torres: NFL Ticket Price Data. Analyzing the average NFL ticket price in relation to regional location and success
Megan Mackey: The Economics Of The Premier league From 2009-2015. A look at how the league finish of a team in the premier league affects the payments they receive and the consequences that come with that
Nikita Lledo: What direction are we going? Exploring data in what direction african citizens think their respective countries are going.
Matt Tynes: NBA MoneyBall. Analyzing advanced statistics and salary in the NBA
Matthew McGlone: Minesweeper!. An analysis of minesweeper stats between a pro and myself at several difficulties to compare potential record times
Andre Ferreira: The Reality of Climate Change. My project analyzes trends regarding temperature anomalies and its correlation to co2 emission growth.
Diana Zhu: Analyzing Figure Skating Scores on the International Circuit. Using World Championship Figure Skating data to explore scoring trends and identify home-town advantage.
Emily He: U.S. Teacher Characteristics Throughout the Years. How have the qualifications of U.S. teachers evolved over the years?
Drake Johnson: Protest Risk in Different States and Countries. An analysis of the likelihood of violence and/or fatality by the police during a protest in any given state or country
Jasper Goodman: The Presidential Politics of COVID-19. This project provides a framework for understanding COVID-19’s impact on the 2020 presidential race.
Ruy Martinez: COVID Cases and Deaths as a Factor of Race. How does race factor in to who gets COVID and who dies — and what does it say about how we deal with disease?
Rena Cohen: First to Close, Last to Open: The Economic Effects of COVID-19 on Arts Organizations. An exploration of the financial impacts of COVID-19 on the arts and culture sector using survey response data from over 17,000 organizations collected by Americans for the Arts
Sammy Murrell: Work-Related Fatal Injuries in New Zealand (2000-2014). An exploration of work-related fatal injuries by demographics, working circumstances and location.
Yishak Ali: Fair or Exploitative Pricing: Pharmaceutical Drug Price Trends. An analysis of recent rising drug costs in the US in the context of R&D costs to determine if firms are raising prices for R&D reasons or profit-raising motives
James Joyce: Global Terrorism and Economic Inequality. An analysis of how economic inequality affects the frequency of terror attacks around the world.
Derek Chang: U.S. Economic Growth: A Look Back and a Look Forward. An exploration of the current economic environment and public policies amidst COVID-19 that begins with an examination of America’s recent economic history.
Sofie Fella: Female Athletes, Body Image and Societal Expectations. My project investigates data on women in sport, general female body image, how female athletes are seen in media and most importantly the relationship between female athletes’ and their body image.
Rom Blanco: Urban Population and Forested Area. A study of the relationship between urban population and forested area in 227 countries and territories between 1990 and 2019.
James Wolfe: U.S. Politics and Public Opinion in the Wake of Covid. My project analyzes the state of U.S. public opinion and political sentiment in the time of coronavirus, with a focus on Presidential politics and the 2020 election.
Khalid Thomas: PGA Tour Data: How do you match up against the pros, and what does into a good golf game. An analysis of how to improve your golf game.
Ciara Duggan: Predicting World Bank Group Employees’ Overall Job Satisfaction. Which variables related to World Bank Group employees’ work and workplace environment are most predictive of their overall job satisfaction?
Hiren Lemma: A Multifaceted Analysis of Trends in Voter Data. An exploration of the extent to which various factors affect trends in voting data, based within the United States by county
Owen Asnis: The Battleground: Wisconsin, Michigan and Pennsylvania in Contemporary Presidential Elections. Studying the battleground states of Wisconsin, Michigan and Pennsylvania in contemporary presidential elections
Anthony Morales: Does Money Win You Titles in the Premier League? An analysis of how the money spent by each Premier League team each season affects their performances.
Winona Guo: Climate Colonialism. This project explores how colonial histories have impacted countries’ climate risk today.
Abigail Skalka: EU Sanctions 2002 - 2020. This project examines the people and entities subject to EU sanction in a comparative framework with the United States.
Aidan Borguet: NFL’s Best Plays. It is an analysis of the NFL’s highlight plays for each team and what their tendencies are.
Geena Kim: NYC Crime Analysis 2019-2020. An analysis of how different factors and characteristics correlate with being a crime suspect or victim in NYC.
Bobby Current: Education and Political Leaning. I look at how much, if at all, education spending by district affects voting pattern in presidential elections, with both the total spending and the spending per capita being used.
Lavinia Teodorescu: How communism affected Women Rights. My project compares data from communist, post-communist and “never-communist” countries and analyzes differences in employment, education and healthcare
Uluc Kadioglu: UFO Sightings in the United States. An analysis of the different factors that might be correlated with UFO sightings across the US.
Ana Castaner: Does Political Affiliation Influence County-Level COVID-19 Infections in the United States? An analysis of the relationship between county-level political ideology and COVID-19 case density across the United States.
Buddy Scott: Buddy Ball: Understanding NBA Finances Amidst COVID-19. Analyzing finances of NBA teams and how COVID-19 (specifically the lack of revenue and income generated by ticket sales) could affect the economics of the league.
Liam Hall: Oklahoma Demographics and Economic Status by School District and the Likelihood of a Special Olympic School Program. What is the correlation between demographic groups, economic status, and the presence of a Special Olympic School Program in the state of Oklahoma?
Sophie Li: Analyzing COVID-19’s Impact on Poverty in Sub-Saharan Africa. Analyzing COVID-19’s impact on poverty in sub-Saharan Africa
Satoshi Yanaizu: Comparison between Global Steel and Aluminium Production. My project analyses the global trend in steel and alumnium production
Reem Ali: Failed Punishment: How US Sanctions Have Impacted Development Worldwide. An investigation of how US Sanctions (1980 - 2015) have impacted developement (evaluated through the indicators of public health, education, democracy scores, imports, and exports) worldwide.
Eleanor Fitzgibbons: Supreme Court Justices Over the Years. My project explores the voting history and ideological leanings of each Supreme Court Justice from 1946-2020.
Naomi Jennings: Misogyny in Rap. A look into trends of misogynistic sentiment in rap music over the decades
Felix Deemer: How does Politics impact Income Inequality? My project investigates the link between income inequality and political institutions, and how certain political environments are correlated with greater levels of inequality.
Daiana Lilo: Analyzing the Emotional and Psychological “Tells” of a Supreme Court Justice. Looking to see if behavior is a good inidcator in predicting how a Justice will vote during the oral argument stage of a case.
Anh Ton: Analysis of Prosecution Data in Middlesex County, MA. Analyzing the Efficacy and Racial Disparities in Prosecution
Lucas Gazianis: Exploring the Ideological Tendencies of the Supreme Court. This project explores the ideological consistency of Supreme Court justices, more specifically how frequently they vote out of lockstep from the other justices appointed by presidents of the same party.
Josiah Meadows: Forces Behind Florida Voter Registrations. Determining how closely rising unemployment, COVID-19 deaths and cases, and other pivotal events in 2020 correlate with increases in FL registration numbers.
Salomé Garnier: Sexual Education and Health Outcomes for Women and Girls. My project explores the relationship between sexual education and health outcomes for women and girls, using data from the Sustainable Development Goals.
Kendrick Foster: Predicting Corruption in Latin America. What economic and political factors are most useful in predicting corruption in Latin America?
Lukas Emge: NFL Combine: Do the Results Really Matter? My project investigates and analyzes the data from the 2013 to 2017 NFL Combines, looking at which tests matter the most for different positions and what each test can tell us.
Prashanth “PK” Kumar: County-Analysis of COVID-19 by Education and Population. Here is a look at each county in the US and their COVID rates as affected by demographics including education, population, and population density.
Pierce Bausano: Harvard Wrestling Database. A database of Harvard wrestlers.
Charles Hua: Assessing the Landscape of Climate Political Fundraising. My project analyzes the landscape of climate political donations in the 2020 election, broken down by political party, political office, and state.
Janna Ramadan: Perpetuating Islamophobia in the United States: Examining the Relationship Between News, Social Media, and Hate Crimes. Exploring the role social media plays in perpetuating negative sentiments towards Muslims and the conflated Arab and South Asian communities and the relationship between hate crimes and media coverage.
Sreya Sudireddy: Effect of Social Distancing Policies on COVID-19 Outcomes. A look at how Massachusetts’ social distancing policies have affected COVID-19 outcomes.
Trisha Prabhu: Analyzing @realdonaldTrump: A Deep Dive Into Donald Trump’s Tweets. An analysis of what drives Donald Trump’s sentiment on Twitter.
Setu Mehta: Going the Social Distance in New York City. Exploring the links between social distancing violations and COVID-19 cases, deaths, and hospitalizations in New York City.
Katrina Keegan: Is Someone, Anyone, Leading Protests in Belarus? The goal of this project is to understand how the 2020 protests in Belarus are organized and led based on information from the social media platform Telegram.
Andrew Jing: Equity in Education: Effectiveness of State-Level Policies. Using specific education policies to predict the equitablility of state-wide systems.
Victoria Wang: Violence at 2020 Protests. Exploring factors that predict violence at nationwide protests in 2020.
Seth Filo: Predicting NFL Success by College,Combine, and Physical Traits. This project is a deep-dive on what traits and past production have the most predicitve power for NFL success, it studies each of the 22 offensive and defensive positions and strives to find key statistics that may be undervalued.
Noah Dasanaike: How is Democracy Changing? Measuring changes in democracy over time with predictions for the future.
Vlad Ivanchuk: Life on the Frontline: People’s Access to Services Amidst an Armed Conflict in Eastern Ukraine. Analysis of access to basic services and reliance on local government among people who live in the midst of the military conflict in eastern Ukraine.
Zelin Liu: Elephants, Donkeys, Doves, and Hawks: Predicting U.S. Treaties. Examines the impacts of political party and military spending on how many treaties the Senate will receive in a given year.
Sophie Webster: HUDS Feedback Analysis. This project analyzes 2500 text messages of feedback sent to Harvard University Dining Services (HUDS) by students in 2019.
Andy Wang: Contributions In and Out of the Classroom: Harvard Faculty Political Spending. This project looks at political spending behavior of Harvard faculty since the 2016 election, as well as modeling for variables such as age, gender, and school
Jeremiah Kim: Harvard 2023 Social Connections. We analyzed social connections across the Harvard class of 2023 with respect to demographics, campus geography, and extracurriculars. Other members of the group: Helen Pang, Jack Kelly, Emily Ni, Kelsey Wu and Mark Stephens.
Jamal Nimer: Harvard Housing Project. We randomly assign blocking groups to one of the twelve houses and compare the distribution of identifiers (i.e., ethnicity, varsity athletes, legacy) to the actual distribution at the college. We also study patterns in blocking group formation. Other members of the group: Eliot Min, Lucy He, Austin Li, Sam Saba, Carina Peng, Angie Shin, Shojeh Liu and Ilyas Mardin.
Lainey Newman: Union Density & Democratic Vote Share in the U.S. 1976-2016. My final project for Gov 1005 looks at the relationship between union membership density and Democratic vote share in presidential elections.
Wyatt Hurt: Transboundary Water Conflict. An exploratory study that interrogates qualitative research claims about the causes of transboundary water conflict.
Jessica Edwards: Investigating Demographics within Higher Education Engineering Programs. This project analyzes racial and gender demographics for highly ranked US engineering programs within colleges and universities.
Paolo Pasco: Thinking Inside the Box: Analyzing Crosswords. Exploring the New York Times and Los Angeles Times crosswords for patterns, trends in difficulty, and changes to language over time.
Lindsey Greenhill: Human Trafficking and Exploitation. An analysis of the Counter Traffickign Data Collaborative’s publicly available data focusing on the demographics and movements of trafficked victims across the world.
Luke Kolar: Discovering ‘Discovery’. Examining samples, remixes, and covers of the 14 songs on Daft Punk’s Discovery, and exploring potential connections between use volume and song qualities.
Vivian Zhang: EU Remittance Flows. A visualization tool to analyze EU remittance inflows and outflows by country and by year from 2000 to 2018.
Jessica Wu: In the Time of Coronavirus. This project explores the state of life, love, and sorrow in the time of coronavirus and social distancing, from the rise of boredom and the deterioration of relationships to the manifestation of socioeconomic inequality.
Elias DeLeon: This Is Magic. Looking at what makes a Magic: The Gathering card “good,” specifically tournament viability and selling price on online marketplaces.
Liz Hoveland: Effect of Absentee Voting on Future Electoral Participation. A study looking at the effect first-time voting type has on future voting behaviors.
George Dalianis: Social Spending Programs in OECD Member Countries. Examines the effect of social spending programs on GDP per capita, economic inequality, life expectancy, and other social/economic variables in OECD member nations.
Saul Soto: A Decade of Dance. A comparison between the UK and Germany Top 100 Charts from 2000 - 2010.
Yong Lee: Effect of Attendance Policy on Synchronous Online Lecture Attendance. Survival analysis examining differential levels of attrition for synchronous lecture attendance in online classrooms.
Jason Yoo: Is Chess Solvable? Looking at opening moves in chess and various other factors to determine their correlative effects on win percentages.
Nidal M.: Urgent Fury: Will Your Coup d’Etat Succeed? A project that predicts the probability of coup success.
Alexandra Ubalijoro: Impact of Demographic Factors on Obesity Rates in the US. A project that looks at obesity rates in the US and how these are affected by social factors such as food access and income.
Michael Chen: Impact of Varying College Characteristics on Innovation Rates. This project is a comprehensive analysis on Opportunity Insights data on patent rates by college, as well as data on college characteristics from the College Scorecard Project.
Westley Cook: Payroll and Performance: Does Money Buy Wins? An analysis of the relationship between payroll and performance using data from the NBA and MLB, demonstrating a moderately strong relationship between spending and success for most teams except the New York Knicks (surprising no one).
Jad Maayah: Women and Religion in the Middle East. This project analyzes public opinion behind policies and cultural norms that promote gender equality in the Middle East and explores the relationship between discriminatory values and respect for religious authority.
Michael Wu: Unemployment in the Era of Covid-19. Examing the effect of the coronavirus pandemic on US weekly unemployment claims across all fifty states.
Amy Zhou: U.S. Case Law Over the Years. Examines US officially published case law and the salience of various topics. Takes a special look at gender and the Supreme Court.
David Sutton: Switchers: Analyzing the Relationships Between Vote-Switching & Demographics, Policy Positions and Public Opinion. This project investigates switchers - voters who cast their ballot for a major party’s candidate in one election and then cast their ballot for the other major party’s candidate in the following election (same office).
Richard Zhu: Oscars So Local?: Film Awards by Demographics and Geography. Analysis of the Academy awards versus other film awards in terms of geography, demographics, and popularity over time.
Benjamin Villa: The Sacred and the Profane: Social, Religious & Medical Effects on Emotional Wellbeing. The following project attempts to see how different social, economic medical, and religious factors have an effect on the mental health of 89 Metropolitan and Micropolitan Statistical Areas in the United States in the Year 2016.
Arnav Srivastava: Global Health Spending Trends. Exploring relationships between health spending and factors such as income and demographics for various countries, helping us better understand the similarities and differences in health spending across our global community.
Cassidy Bargell: Sport Perceptions. This project looks at how sports are searched for and talked about on the internet, specifically related to concussions and other injuries.
Scott Mahon: Effect of MLB Statistics on Team Winning Likelihood. A look at how different MLB statistics, such as runs, batting average, etc. impact a team’s total number of wins at the end of the season.
Raymond Hu: Online Grocer Case Study: Instacart. Analysis of Instacart shopping trends and a comparison with traditional brick and mortar grocery stores
Rachel Phan: Obesity and Food Insecurity in the US. This project analyzes how food insecurity might affect obesity rates in the US and across states. It also takes a look at how poverty and other demographics affect both of these factors.
Elias Abu Nuwara: The War on Drugs: Let the Numbers Speak. An analysis of data on the US War on Drugs. Examining how increasing the intensity of drug law enforcement could yield counterproductive results.
Linda Qin: The Language of Emojis 🤠. Analyzing the most important question of our generation: which emojis do Harvard students prefer?
Belinda Hu: How Startups Can Thrive in the US. An analysis of different factors that affect how startups operate in the US, like type of industry, location, and education in the area.
Karen Jiang: Accountable Care Organizations in Medicare Shared Savings Program. This project looks at the generated savings or losses from ACOs for Medicare based on ACO, patient, and provider factors.
James Hutt: Reshaping the United States. States are increasingly defined by their composition, rather than their geography - this project remaps the US based on flight paths, migration, and political, religious and racial homogeneity. Finally, the East and West Coast can be together, like they always wanted.
John Morse: HIV and the Potential of PrEP. A look at HIV diagnoses across the U.S., and the potential cost savings offered by PrEP.
Katherine Wang: Schools as Social Mirrors. This project analyzes the extent to which a school’s inequality mirrors that of its community’s
Julia Englebert: Agriculture and Education in the Midwest, 1870-1960. This project uses census data and mapping to explore the history of agriculture in the Midwestern United States, paying particular attention to its relationship with education.
Hannah Phan: Effect of Public Infrastructure on Boston Crime Levels. Analysis of how public resources and amenities such as Bluebikes, streetlights, and trees influence crime levels in Boston from 2015 to present.
Connor Riordan: How Health, Income and Education Affect Voting. A look at how three separate variables, health, income and education, affect voting in the 2016 presidential election.
Hudson Miller: College and Military Demographic Comparison. Comparing the demographic breakdowns of members US military and undergraduate students.
Paddy Adams: Brexit Voting Demographics. Understanding the basis for demographic stereotypes following the 2016 Brexit referendum by analysing national census data and the correlations to Brexit voting.
Hamaad Mehal: NBA Fine Data. Looking to see if NBA fines work equitably and efficiently in changing certain behaviors in players.
Suruchi Ramanujan: Opioid Trends Across the United States. This project uses data from the CDC and chapter55.digital.mass.gov to examine trends in opioid death and treatment in the United States.
Hamid Khan: Cricket Analytics. Analyzing how changes in the rules of ODI cricket have impacted run-scoring and wicket taking
Leena Ambady: Organ Donor Registration and Demographic Trends in New York State. Looking at how demographic factors like age, income, and race might affect organ donation registration rates in New York state counties
Lara Teich: Winning Early in Curling. Looking at the influence of winning the first end on winning the full game in curling for Olympic curlers and College curlers.
Jerrica Li: Coronavirus Up Close. I am creating visuals and maps for understanding the scope of underreported confirmed cases and the magnitude of COVID-19 in the US.
Matej Cerman: Educational Inequality in Slovakia. Exploring the ties between socioeconomic conditions and unequal educational outcomes among Slovak regions.
Ella Michaels: Goodwill Hunting. Looking at goodwill locations throughout the US, searching by and understanding their relationship to neighborhood housing prices.
Rebecca Xi: The Covid-19 Data Project. A group project to analyze the spread of COVID-19, its economic impact, and the efficacy of government policy in mitigating the crisis. Other members of the group: Jun-Yong Kim, Katelyn Li and Nishu Lahoti.
Emma Freeman: Diversity of Upper Level Educational Institutions. Exploring factors, such as female population or admissions rate, that may contribute to racial diversity in upper level educational institutions.
Chase Souder: Modernization in Drum Corps International. Investigating the trends in musical selections of Drum Corps International World Class Finalists, as well as potential correlations between the modernity of a repertoire and score/placement.
Jacob Hansen: Is Arizona More Blue Because It’s Less White? Examining whether changes to Arizona’s racial demographics, particularly increasing Hispanic/Latinx populations, are correlated with increased Democratic vote share.
Grace Zhang: Harvard General Education Courses. This project aims to understand trends in general education courses at Harvard and their implications on general education course enrollment caps.
Josh Mathews: NBA All Star Game (1951-2020): Does Player Popularity Impact Performance? This project investigates the effects of popularity on minutes granted to players in the All Star game and their performance while also allowing for visualization of player metrics and shot charts over time.
Owen Bernstein: Content Analysis of Presidential Speeches. Identifying and quantifying the usage of populist, immigrant related, conservative, progressive, and environment related language in presidential campaign speeches in the United States.
Ishan Bhatt: Gender’s Effect on Speaker Points in National Circuit Lincoln-Douglas Debate. Using pretty much every national tournament from the past three years, I investigated to see if there’s a gender bias in the amount of speaker points judges assign to debaters.
Cameron Reaves: Predicted Net Migration from Sea Level Rise by 2100 for US Counties. This project visualizes a dataset containing predicted net migration from sea level rise by 2100 for US counties
Jason Rose: Harvard College Courses. In this project, I visualize trends in Harvard’s academic offerings and course enrollment.
Daniela Teran: Gender Equality and the Informal Economy: An analysis of the Andean Community Countries. This project aims to visualize the correlation between the percentage of women working in the informal economy and gender inequality in the Andean Community countries.
Kiera O’Brien: Climate Policy & Public Opinion. I analyze a common misconception regarding American politics and the climate challenge: that Republicans are opposed to action of any sort.
Taylor Greenberg Goldy: Understanding E-commerce Analytics and Purchase Behaviors. This project aims to look at different purchasing behaviors of customers on an e-commerce website and predict and suggest what other products they may also want to buy.
Teddy Landis: Effect of First-Time Voting Method on Future Participation. We look at first-time voters in North Carolina and see if voting method (absentee ballot vs. in-person) has an effect on future participation in elections.
Brian Kim: Korean Parliamentary Elections from Democratization (1987) to 2016. This project looks at election results data from democratization in 1987 to 2016 to see nationwide PR voting behavior, regional voting behavior (based on single-member districts) and any relationship between senior population and vote share by ideology.
Jenna Moustafa: Common Core Standards and Racial Education Achievement Gaps. This project assesses the impact of the implementation of the Common Core Standards on narrowing the educational achievement gap for racial minorities.
Tivas Gupta: Interactions Between Inequality, Populism, Social Spending, and Happiness. This product looks at the relationship between inequality and populism, social spending, and happiness throughout countries worldwide.
Will Schrepferman: State of the Union Text Analysis. This project applies textual data science techniques- including sentiment analysis, topic modeling, and neural network-powered Natural Language Generation- to State of the Union addresses!
Arushi Saxena: COVID-19: Early Public Sentiment about Social Distancing. The purpose of the project is to understand public sentiment about social distancing via Twitter data, across the United States, and correlate it to the extent to which US states are effectively social distancing.
Diassa Diakité: College Football: Win Regression Model. A dive into which statistics in college football over the past ten seasons correlate best to success within a given conference, defined as conference win percentage, and throughout most of Division 1 FBS football, defined as total win percentage.
Thomas Weiss: Moral Hazard Among Lawyer-Legislators. This study examines whether the professional background as an attorney of many U.S. politicians has an influence on their voting behaviour.
Mak Famulari: FBI’s Top Ten Most Wanted Fugitives. This project investigates the FBI’s Top Ten Most Wanted Fugitive list since its inception; it consists of a breakdown of criminals and their crimes, as well as considerations of special cases.
James Bikales: Demographics of EV Charging Station Placement. This project examines more than 28,000 EV charging stations in the U.S., comparing their locations to Census demographics, such as median household income, of the county in which they are located.
Andrew Courtney: Trends from the 2018 Midterms in CA-48. I study the support for candidates in California’s 48th Congressional District during the 2018 Midterm elections by demographic breakdown.
Brendan Chapuis: How Accurate Are Prediction Markets?. This project examines the accuracy of prediction markets as compared with polls in the context of the 2016 and 2020 Democratic primary elections.
Stephanie Cheng: Deep Dive into the Design Census. This project investigates the demographic of the US design industry and provides a predictor for job satisfaction and salary by a few metrics of interest.
Grace Pan: Need A Good Date Spot?: Analyzing Yelp Restaurant Reviews. This project uses a geographically diverse subset of Yelp data to analyze what type of restaurants make good date spots, the results of which are useful not only for individuals trying to impress their dates, but also for professionals in the restaurant business looking to attract more customers.
Julius Gunnemann: Berlin housing: A diff-in-diff assessment. Berlin has intervened in the apartment rental market in an unprecedented way, demanding landlords to cut prices to 2010 levels. Can this work?
Gabe Cederberg: Partisanship and COVID-19 Shelter-In-Place Adherence. Partisan leaning significantly impacts the degree to which people follow COVID-19 shelter-in-place orders.
Tahmid Ahmed: Weather Effects on NY Giants. This project focuses on the effects of weather on the NY Giant’s performance and attendance using regression modeling.
Asmer Asrar Safi: Government Censorship on Twitter: Requests and Response (2012-2019). This project tracks content withdrawal requests made by Governments on Twitter and Twitter’s response, in relation to a country’s GDP and overall Freedom Score.
Jonah Fried: Professor Ratings. My project is an analysis of the effect of different factors on the rating of professors at a variety of colleges.
Henry Austin: Visualizing Wealth Inequality in Michigan. This project analyzes at the geographic movement of wealth across Michigan over the past several decades and its impact on educational attainment, as well as the impacts of past policies on the state we see today.
Ibraheem Khan: Chicago Public Schools. The project explores the interplay between Chicago Public Schools’ budget deficits, school closures, and overall enrollment
Mohamed Mabizari: College Football: Win Regression Model. A dive into which statistics in college football over the past ten seasons correlate best to success within a given conference, defined as conference win percentage, and throughout most of Division 1 FBS football, defined as total win percentage.
John Mark Ozaeta: Worldwide Correlates to Happiness. An exploration of how different variables affect the happiness of countries around the world.
Shreyvardhan Sharma: Mapping COVID-19: Outbreak, Second Order Effects, and Public Opinion Analysis. An analysis of COVID-19 involving mapping the spread of the coronavirus across the U.S, measuring the second order effects of the disease, and analysing the change in public opinion over the course of the disease.
Julian Habermann: Immigration Ineligibilities. An analysis of immigrant ineligibilities that hopes to find the common reasons why immigrants are denied.
Micah Williams: Visualizing Covid-19 in the United States. Visualizing the spread of Covid-19 In the United States with graphs and animations.
Naina Tejani: It’s not a Myth: Climate Change and its Related Factors. This project traces the rise in global temperature over the past century and looks at factors that might be responsible.
Garrett Rolph: Students in Schools. I compared factors such as school expenditures, average teacher salaries, and school demographics to explore the relationship these variables have with standardized test scores (both SAT and ACT).
Yanghe Liu: COVID-19 in the World. This project explores visualizations of COVID-19 in the world in multiple perspectives, including but not limited to map and histogram.
Fatma Al-Alawi: The Greenest Building. An exploration of how much energy is embodied in our built environment, and what it means to demolish existing buildings rather than reuse them.
Tate Huffman: MLB Pitch Sequencing. An exploration and analysis of pitch sequencing and its effects from 2015 through 2019 in Major League Baseball.
Rachel Auslander: Investigating Mainstream News Coverage of Presidential Candidates. An exploration of whether news source choice influences the likelihood to vote for a particular candidate.
Chase Bookin: High School Players and the Major League Baseball Draft. An analysis of the decision made by top high school baseball players to sign in the Major League Baseball draft.
Adelson Aguasvivas: Effects of Redlining in New York City. A study looking at the effect of redlining in New York City from 1937 to 1940, mainly affecting communities of color, their median household income and housing value from 1950 to 2010.
Kayla Manning: Undergraduate HUDS Traffic Patterns in 2017-2018 and 2018-2019. This project analyzes data on swipe counts for every day, location, and meal for each of the undergraduate HUDS facilities over the 2017-2018 and 2018-2019 academic years.
Yao Yu: Gun Violence Decrease in San Francisco and Oakland. A study looking at why gun violence in San Francisco and Oakland decreased from 2013 to 2017 while it increased in other US cities during that time.
Sydney Sorkin: NCAA on Twitter: Does the NCAA tweet about Men’s and Women’s sports differently?. This project analyzes the tendency of NCAA affiliated Twitter acounts to tweet about male or female sports and athletes.
Billy Koech: Kibuon Project Data Analysis. Determining an optimal well location and water usage trends for a community in the southwest region of Kenya.
Bridger Gordon: Social Connectedness in America. Analysis of Facebook’s Social Connectedness Index shows that the geographic closeness is a significant factor in who we know / interact with, which is one possible explanation for demographic similarity in social circles.
Molly Chiang: New York City Airbnb and Housing Prices. In general housing prices and Airbnb prices are very slightly positively correlated, but there is lots of variation within and between boroughs.
Katherine Enright: Public Opinion Surrounding the 2019 Hong Kong Protests. This project analyses underlying public opinion factors behind the Hong Kong pro-Democracy protest movement as well as exploring a Twitter-based Chinese government operation to influence international opinion about the protests.
Carine Hajjar, Diego Arias and Johannes Lang: Harvard Friend Group Selection Study. We analyzed how different characteristics influence Harvard students’ choices of friend groups.
Aysha Emmerson: Project Resilience. The Project downloads tweets pertaining to “resilience,” performing a series of analysis—including a sentiment analysis and word cloud plot—to investigate what feelings, concepts, and words, the general public associates with this concept.
Michelle Gao: Performance of Political Ads on Google. I look at the relationship between Democratic presidential candidates’ Google political ad spending and their primary polling results over 2019.
Mari Jones: Race & Gender Implications of the Criminal Justice System. Analyzed race and gender associations within the criminal and youth incarceration systems.
Mengxi Tan: US Immigration Explorer. This website examines immigration into US, and answers the two following questions: 1. Where are the immigrants coming from, and through which channels are they admitted? 2. Once the immigrants are in the US, how well are they fitting into the society?
Mitsue Guerrero: Water stress in Mexico City. Given the large population and limited water resources available in Mexico City, we visualize the consumption at block level to identifiy the biggest consumers and observe the water inequality gap that is making the city run out of water by 2050.
Lewis Zou: Predicting the Results of LoL Games. I analyze the 2018 League of Legends World Championships and find what factors are most important in determining the outcome of a game.
Amal Abdi: Exploring Evictions and Rent Burden in Ohio. I look at county-level evictions data throughout Ohio and rent burden by race and ‘ruralness’.
Miroslav Bergam: Donations of Harvard-Employed Individuals to 2020 Presidential Campaigns. Elizabeth Warren recieves the most support among Harvard-employed individuals by most measures, and other facts like overall campaign size, political ideology, employer, and occupation are correlated to the size of the donation and the campaign being donated to.
Feven Yohannes: Ethiopia Economic Development. In this project, I’m using data from the Worldbank and The UN that shows the social and economic changes that have been occurring in Ethiopia in the last 60 years.
Alexandra Norris: Visualizing Migration. I visualize migration data, looking at where refugees are coming from, where they are going, and the relationship between the number of refugees entering a country and different GDP indicators.
Cian Stryker: China and the Belt and Road Initiative. This website is an introduction to China’s Belt and Road Initiative, which is one of most important geopolitical phenomena of the 21st century.
Hanif Wicaksono: Cambridge Energy Use. A visualization tool to understand building makeups and how City of Cambridge uses energy.
Jackson Kehoe: Orchids Around the World. A multi-faceted approach to better understanding the global distribution and trade of orchids.
Daniel Shapiro: Russian Regional Demographic Change. This project breaks down Russian demographic data by region and analyzes trends and patterns over time.
Sanjana Ramrajvel: Homelessness in the U.S.. This project seeks to examine how well our country meets the shelter needs of its homeless population.
George Guarnieri: The Harvard Shop Sales. My project performs an analysis of sales data from the Harvard Shop, specifically focusing on web sales.
Jessica Scazzero: Is Cash Here to Stay? An Exploration of the Factors that Drive Individual Cash Use. This project used the Fed Consumer Payment Diary surveys to analyze individual cash usage by individual level characteristics, transaction level characteristics and time variables.
Chloe Shawah: Fingerprints of Colonization. The project seeks to determine if there are effects of foreign colonization/occupation traceable today by tracking indicators of the economic prosperity, health, and education in world nations over time.
Grace Rotondo: Fixing the Flaws of Networking: An Alumni Directory of the Harvard Women’s Lacrosse Program. This is an alumni directory of the Harvard Women’s Lacrosse Program, a platform for Harvard Women’s Lacrosse affiliates to easily access accurate alumni information.
Joshua Pan: Dunk on Some Stats. This project analyzes trends in professional and college basketball and comes up with models, focusing primarily on player positions/roles.
Angela Fu: United Nations Resolutions. The project analyzes UN resolutions that were voted upon dating back to 1986.
Emily Axelsen: More Permits More Problems? Tracing Factors Correlated to Gun Violence. I analyzed trends in the number of permits granted and the resulting number of gun violence incidents and noticed that the per capita number of gun violence incidents is similar among states that have gun violence policies (such as required gun registration and waiting periods) and states that do not.
Katie Cao: The Billboard Top 100: An Analysis of Timelessness and Lyrical Content. How does the lyrical content of songs predict their timelessness?
Diego Flores: Just How Great (Truly) is Democracy?. This project aims to determine whether or not Democracy is truly the superior form of government by assessing potential relationships between its implementation and variables associated with gauging the prosperity of a society.
Sophia Zheng: Housing Prices in Three Major US Cities. This project aims to compare historical housing data over the past thirty years in New York, San Francisco, and Seattle in order to see correlations with income, and study distribution by zip code.
Dominic Skinnion: Whom Does the Electoral College Benefit?. The electoral college appears to benefit Republicans more than Democrats.
Rick Brown: Football Wins and College Applications. I analyze the change in applications to colleges based on the change in wins of each college’s football team and find a very weak correlation between the two.
Mariah Dimalaluan: Can you make it to next year’s Billboard Hot 100 Chart?. This project attempted to study what song characteristics – especially the key it was written in – make it likely to have a higher ranking on the Billboard Hot 100 chart.
Jeremy Ornstein: American Creative Class. How do members of the creative class – artists and engineers – relate to populations and incomes of american counties?
Rucha Joshi: Analysis of Seasons and Characters of The Office. This project analyzes information about the characters’ frequency and emotions during the show.
Elizabeth Pachus: Firearms and Sucide in America. This project investigates the correlation between suicide rate and firearm death rates while also exploring which groups of people are being primarily affected.
Abrar Trabulsi: Outlining the Relationshop Between Regime and Economic Development. Exploring the relationship between regime and economic development, especially with regards to autocratic and democratic regimes.
Andy Price: MLB Pitcher Raw ‘Stuff’. This project predicts pitch outcomes based only on a pitch’s velocity and movement.
Hannah Valencia: Analysis of Queen’s Music. This website takes a look at Queen’s music and analyzes the audio features in their 15 studio albums.
Pieter Quinton: Housing Market. An examination of the housing market, starting at the national level and narrowing the scope all the way to just one city.
Bernadette Stadler: Equal Work, Equal Pay? The State of Women’s Soccer in 2019. This project allows users to explore the numbers behind the USWNT discrimination lawsuit against U.S. soccer, as well as to look at the state of gender equality in professional soccer world wide.
Hoda Abdalla: The Effect of Media on Presidential Primary Candidate Performance. I look at the relationship between mainstream media mentions for major 2020 Democratic Presidential Candidates and their performance outcomes, measured in polling percentage and betting prices.
Cristopher Patvakanian: Armenian Diaspora Project. This project showcases the different Armenian Diaspora communities in the world and provides information with regards to their locations and sizes.
Alexander Klueber: An Investment in the Past. Carbon footprint of the Harvard Endowment and break-down to individual students.
Ryan Graff: NBA Statistical Trade Machine. My project calculates the average statistics of NBA draft picks and combines them with current player data to create an NBA trade/Value comparison machine for the user, while also performing regression on draft picks and their advanced stats.
Amanda Su: Trends in International Student Choices and Motivations in the U.S.. This project looks to analyze various patterns in the experiences and motivations of international students in the United States, specifically examining country of origin indicators, fields of study, and funding sources.
Camila Sanmiguel: The Violence Crisis in Mexico: Public Perceptions of Safety Alongside Border Apprehensions. Visualizing the spikes and falls in violent crimes in different Mexican states allows us to track the movement of the Mexican drug war; this project also examines the Mexican public’s perceptions of general danger beside Border Patrol apprehensions along the southern border.
Harrison Burke: Exploring Olympic Rowing Success. I look at various factors that influence performance at the olympics.
Jake Schonberger: Unclaimed Property. This project focused on diving into datasets related to “Unclaimed Property, specifically Californias unclaimed property division.
Victor Chen: Cincinnati Bengals 2018 Season Analysis. I look at play-by-play data for the Cincinnati Bengals in 2018 to find key drivers of expected points and win probability.
Anan Hafez: An Analysis of 3-Pointers in the NBA. Professional basketball is obsessed with the 3-pointer, how did we get here and what has changed because of it?
Oren Rimon Or: Inventors in the US. This project explores the relationship between economic mobility, income distribution and invention rate in the US.
Sam Lowry: The Effects of IMF Structural Adjustment Programs on Angola. Using World Bank and WHO data, I analayzed the effects of IMF loans on Angola in order to better understand how the IMF influences developing countries.
Chelsea Marlborough: Spotify Top Tracks Chart. This project analyzes trends in audio features found throughout Spotify’s Top 100 Chart of 2018.
Kevin Wang: United Nations General Assembly Voting Patterns. This project analyzes the frequency at which different countries vote in the majority in the UN General Assembly.
Liz Masten: Fatalities in Yemen’s Civil War. This project attempts to make sense of the chaos of Yemen’s civil war through mapping and analyzing instances, methods, and attributability of attacks.
Drake Deuel: Strava Leaderboards. This project looks at the relationship between Strava KOM ranking and climbing time.
Olly Gill: The History of The Olympic Games. I looked at data from the past 120 years of Olympic History in order to learn more about what has both changed and stayed the same for the Games and the athletes that make them so special.
Gayatri Balasubramanian: Distribution of Ethnicity and Industries in Indiana. Ethnicities tend to collect in pockets across the state, and this project overlays industries and ethnicities to see if perhaps one is more closely located near the other.
Ali Crump: Visualizing NHL Statistics. Analyzing NHL statistics which date all the way back to 1917.
Morgan Booker: Catching Criminals: A Criminal Minds Analysis. An in-depth analysis of the creative elements of first five seasons of Criminal Minds and the twisted criminals the team chases.
Madeleine Snow: It’s A Hit: Tony Awards for Best Plays and Musicals. This project examines Tony Award-nominated and Tony-Award winning Broadway plays and musicals, 1948-2019.
Isheka Agarwal: Perspective on Historical and Future Consequences of Climate Change. I analyzed the historical consequences of climate change, future projection for consequences of climate change and opinions of people about climate change living in various regions in the United States.
Grace Kim: Believers in the Divine: The Religions of South Korea. This project explores the various religious groups in South Korea, more specifically the relationship between the rise of Christianity compared to Buddhism.
Togo Kida: An Analysis of Creative Class in the United States. Analyzed the socioeconomic status of designers and creatives in the United States.
Cade Knox: NFL Big Data Bowl. This project aims to create a model that will predict how many yards per carry will happen in an NFL run play.
Prachi Naik: Making the Case for Investment in School-Based Mental Health. I analyze 2016 School Survey on Crime and Sfaety to understand the circumstances surrounding schools’ ability to provide mental health service.
Emmanuel A. Calivo: Income and Transit Access in the San Francisco Bay Area. An analysis of the relationship between household income and access to public transportation in the SF Bay Area.
Parker Mas: Behind Billboard: Exploring the Audio Features of Pop Music. Analyzes Billboard Hot 100 chart data and song audio features in order to provide interesting data visualizations and model peak song popularity.
Margaret Butler: Gothic Literature and Monstrosity. I attempted to look at monstrosity in five different classic gothic lit books through word analysis.
Minjue Wu: Analysis of novel fly model for X-linked Dystonia Parkinsonism. A review of a novel animal model used to simulate a neurodegenerative disease, analyzing relationships between knockdown gene pathway, viability, sex, and recovery for potential recommendations in future research
Amy Tan: Academic Achievement in the U.S.. This project looks at various socioeconomic covariates’ correlations with academic achievement across the U.S. as measured by standardized test scores.
Sophia Freuden: Growing Pains in Portland: A Story of Crime, Unemployment, and Population. An exploration of crime data, unemployment, and population growth in Portland, Oregon over the last ten years.
Erin Guetzloe: Boston Gun Violence. Considering that Massachusetts has some of the most stringent restrictions on gun ownership in the United States, why is gun violence on the rise in Boston?
Elizabeth Guo: Demographics and Votes of the U.S. Supreme Court. Evaluating model accuracy indicates that demographics and party affiliations of Supreme Court justices cannot be used to predict their vote patterns, and that justices are not “politicians in robes” who always vote in party line.
Shivani Aggarwal: How Couples Meet. Visualizing the ways in which different kinds of U.S. couples meet and enter into relationships.
Neil Khurana: Harvard Dining. Archiving Harvard menus and exploring variations and repititon in meal choices.
Dasha Metropolitansky: First-Year Blocking Group Project. Harvard says it fosters a diverse community; trends in students’ housing indicate otherwise. This was a group project. The other group members were: Adiya Abdilkhay, Ilkin Bayramli, April Chen, Alistair Gluck, Christopher Milne, Neil Schrage and Stephanie Yao.
Christopher Onesti: Course Enrollment Statistics. This project presents an inside look and trend visualization regarding fall and spring undergraduate course enrollment data at Harvard.
Margaret Sun: Beyond The Stage. Various insights into the music group BTS.
Ruoqi Zhang: Settling the Dust: Censorship & Environmental Activism in China, 2012. What does social media data tell us about environmental awareness and censorship in China, 2012?
Hemanth Bharatha Chakravarthy: Twitter in the Biggest Elections in the World. Sentiment analysis of the biggest Twitter election campaign in the world and breaking down the twitter farms’ role in it.
Evelyn Cai: Survivor. Outwitting, outplaying, and outlasting: What does it take to become the Sole Survivor?
Sabrina Chok: US Federal Crime Data.
Simone Chu: Presidential Speeches. Text analysis of inaugural addresses, State of the Union Speeches, and news conferences.
Celia Concannon: Tesla Stock and Elon Musk Tweets. Interactive plot showing Tesla stock volume and tweets, table to search Elon Musk tweets by date or keyword, and an about the app tab.
Andres de Loera-Brust: Exploring the Medicare Shared Savings Program. Explore the details of Medicare’s experimental new way of paying for healthcare.
Alexandra Dobbins: Lyme Disease in the United States: a Historical Perspective. Awareness of Lyme Disease as a debilitating illness has increased in recent years – take a look at where the most cases are, and how they’ve changed over time.
Nicholas Dow: Muller Report Text analysis. Creates better way to look for important information in the 448 page Mueller report
Tanner Gildea: 2020 Democratic Candidates’ Tweets. More than 20 Democrats are running for president in 2020. But how are they using Twitter to do so?
Debi Gonzalez: The Sunshine State Turns Purple on Election Day. Visual representation of Florida’s political distribution over time accompanied by county-level demographic data.
Tate Green: Game of Thrones Analysis. In depth analysis of Game of Thrones seasons 1-7
Benjamin Hoffner-Brodsky: Asylum Seekers. Tracking which countries are most likely to accept asylum applications, based on which countries they’re coming from
Jefferey Huang: Money, Efficiency, and Education: California High Schools. Analyzing efficiency and college-readiness test (SAT, ACT, AP) outcomes in California school districts.
Taehwan Kim: An Analysis of Crime in Chicago. Visualizing Crime in Chicago over the 10 year period from 2008-2018
Andrea Lamas-Nino: Crimson Analytics.* Understand engagement with the Crimson’s content.
Jennifer Li: New York Apartment Hunt. Price Analysis of 2-Bedroom Rental Units in Manhattan between 2000 and 2019
Diego Martinez: Baseball Aging Curves. Analyzing age’s effect on performance of MLB players.
Beau Meche: Census: Population Mobility Post-Trump. Looking at apparent movement / growth of the US population in Trump’s first year in office. Did young people with degrees change location?
Igor Morzan: Corruption in Latin America. Visualizing corruption across all Latin American countries and their institutions.
Seeam Noor: Seeam & EPL. Get interesting stats on your favorite English Premier League teams from the last decade
Shafi Rubbani: Organ Donations and Transplants. Different countries have different trends in donation and transplantation rates over time.
Albert Shin: Chicago Ride Share Comparisons. Does Christmas affect ride-share and rider behavior?
Mike Silva: Harvard Football Defense Analysis. Analyzing the 2018 season for the Harvard Football Defense
William Smiles: POTUS & PRICING: How Trump’s Tweets Affect Intraday Trading. President Trump’s tweets and their effects on financial markets
Céline Vendler: Zillow Data Explorer. Using data from Zillow over the past 20+ years, this app visualizes historical and forecasted trends in real estate to compete with Redfin’s data visualizations.
Henry Zhu: Newark Airport Flight Destinations. Visualizing where flights from Newark are headed and delay patterns
Kemi Akenzua: Analysis of death row executions in Texas with an emphasis on sentiment analysis
Ghada Amer: Mapping of global armed conflict post-Cold War
Ryan Michael Antonellis: Analysis of the most highly represented colleges in the NFL draft over the past 10 years.
Esteban Arellano: Analysis of levels of upward economic mobility achieved by race and gender per county.
John Ball: Analysis of rates of mental illness among comedians.
Rana Chandra Bansal: Analysis of Indian exports between 2014 and 2017.
Katherine Elizabeth Binney: Analyzing Massachusetts public school quality.
Charlie Chatman Booker: Analyzing apartment complex data in the city of Houston
Enxhi Buxheli: Russian tweets and their effect on the 2016 US Presidential Election.
Michael Calabro: Analysis of Batting and Pitching data in the MLB in relation to a drastic rise in strikeouts
Cayanne Chachati: Analysis of the deaths over the course of the Syrian Civil War
Maddie Chai: Analysis of the decline in American marriage from 1960-2012.
Holly Jaime Christensen: Analysis of NYC rental unit data in Manhattan – looking at # of units, median asking price, and housing violations
Oliver E Cordeiro: Analysis of strokes gained data from the PGA Tour in 2018
Sofía Corzo: Making a clear descriptive interface to generate data on post-conflict transitional justice mechanisms from 1946-2006.
Cunhonghu Ding: Which college in the United States offers you the best chance to climb the ladder?
Donovan Mac Doyle: I look at NFL gambling data from 1979-2017
Robert Drysdale: I project Gordan Hayward’s statistics if he wasn’t injured in 2017
Annika Engstrom: Analysis of social liberalism across US demographics, with a focus on gay rights issues.
Steven Espinoza: Looking at data concerning sanitary violations near Harvard Square
Grant Fairbairn: Analysis of Tiger Woods PGA Tour data for 2018 season
Maclaine Fields: Analysis of Harvard Women’s Volleyball 2018 Season
Charles Elliot Flood: Do Bilateral FTAs actually help boost trade?
Claire Fridkin: Nicolas Cage movie analysis and sentiment analysis
Melissa Gayton: Analysis of intake interview data for Access to Justice Lab’s Divorce Study
Peter George: Visualization of strokes gained by player per round across several 2018 PGA Tour tournaments
Hannah Elizabeth Hardenbergh: How often do artworks move at the Harvard Art Museum?
Matti Harrison: Analysis of population and it’s affect on home prices in LA and CT
Stone Alexander Nicholas Hart: Analysis of Pokemon statistics spread over generations
Hannah Ella Hess: What is the Impact of Ramadan on Pornography Consumption?
Claire Hotchkin: Analyzing Boston Marathon race times
Sean Hughes: Why are children’s chance of earning more than their parents falling?
Justin Hunter: Recent Trends in Building Permits in Detroit
Tauheed Islam: Hip-Hop and Rap References of the 2016 Presidential Candidates
Sonya Kalara: Analysis of Hate Crimes in New York State from 2010 - 2016 by county, year, and crime type.
Shriank Kanaparti: Housing Price analysis
Saiyaz Kazi: Tracking of Immigrant Voting Preferences in the 2016 General Election
Sara Marie Kvaska: What Influences Public Transportation Coverage?
Molly Kathryn Leavens: I built tools to break down and visualize data from a survey of Ghanian Cacao farmers
Jack Luby: Visualizing the effects of and compliance with the 2014 Gulf of Panama IMO TSS Regulations
Miranda Lupion: Visualizing crime data for Russia’s federal subjects (administrative entities) from 1990 to 2010
Keeley Rose Macafee: Is the U.S. facing a food revolution?
Sofia Marie Mascia: Bridging the Gap: An analysis of Income, Violence and Drug use in a socio-economically divided Illinois
Ethan Robert McCollister: Visual Exploration of Trends in Pitch Data from 2012 to 2018.
Robert McKenzie: NYC Taxi Pickup and Dropoff VIsualization
Michael Montella: US-China Trade
Junho Moon: Looking at key military data of different countries over time
Kodi Obika: Analyzing Ariana Grande’s songs, albums, and lyrics via song length, lexical density, and word frequency/significance
Charlie Olmert: Visualization of the Harvard Men’s Lacrosse team’s shots from the 2018 season with heat maps and filters for shot clock satisfaction.
Annabelle Paterson: New Zealand Wine Exploration: Growth of the New Zealand wine industry.
Kai Potter: Sephora Skincare Bestsellers Explorer: Making smarter, more informed purchases
Richard Qiu: Visualization of effects of Medicaid expansion on immunization rates
Noah Reimers: Analyzing NFL Player Ratings and Draft data between 2008 and 2012
Tanya Rohatgi: Excavating racial basis in false convictions, and the role of Conviction Integrity Units in overturning them, using exoneration data from 1989 to present.
Teresa Noelle Rokos: Analyzing services, patients, and expenditures at emergency rooms
Allie Russell: Analysis of offensive production in the 2017-18 NHL season
Richard Ryan: This project takes data from Fandango via FiveThirtyEight and compares movie ratings from various popular sites. You can look at the correlation between scores among these sites.
Connor Sakmar: Visualizing trends in death totals and rates from the US leading causes of death
Jack Schroeder: Visualizing the San Diego Padres’ Batter Data by Venue*
Dillon Smith: Measures of Winter Olympic success are more strongly positively correlated with voter turnout rates than measures of Summer Olympic success.
Serhiy Sokhan: Analyzing The Harvard Shop’s Web Fulfillment Data
Sydney Alexandra Steel: Gender Employment Patterns
Jordan Topoleski: What makes a top hit on the Billboard charts?
Meaghan Townsend: How do non-school factors influence literacy and social development outcomes for preschoolers?
Max George Vumbaca: A look at demographic shifts and trends in Cambridge’s homeless community across time, compared to other cities, and according to living situation
Gabriel Walker: A tool for exploring trends in Chinese elite diplomacy and overseas financial flows.
Max Weiss: Presidential and Senate Twitter Activity: Trump’s First 9 Months in Office