Project Check-in 1
Brainstorming ☑️
In your term project for this class, you will apply everything we'll learn about processing, analyzing, and displaying data in Python using libraries and custom algorithms. As a former student once put it, this project is about only two things: data integrity and critical thinking.
To do this, you will (later in the semester) collect an amount of data too large to process by hand, such as a collection of books from Project Gutenberg, data collected by statistical reseachers from ICPSR, data you've manually collected in a spreadsheet over a longish period of time, or...well, anything. I'll give plenty of examples of places you can get data in the next Check-in, but you really should pick something that interests you and search around from there.
The point is that you explore an open-ended question related to your interest, you offer an answer grounded in data but well explained beyond just the numbers alone, and you use Python subtantially to help you along the way.
You are free to work on a project on any topic that is of interest to you, your friends, your family, or so on. After deciding on your data, you will use Python to scrape, clean, analyze, interact with, and/or visualize that data. Using those results, you will interpret the data that you collected. The end goal will be to arrive systematically at a meaningful interpretation of data that was too large to process by hand.
First Check-in
For our first check-in, you will brainstorm initial ideas for what data you want to look at and what questions you might ask of it. For example:
- I might collect all the books written by the Bronte sisters from Project Gutenberg and ask, how do the sisters' writing styles compare?
- A previous student collected over 1000 comments from a subreddit and asked, how do people describe the taste of bourbon?
- Another previous student wrote a web scraper that scraped stat numbers about routers from a certain website, then compared these stats
- Another previous student collected data about what features of bonsai trees were associated with them winning awards
- I've had students do projects on legos, transformer figures, their spouses' businesses, the stock market, the 1918 influenza pandemic, tweets using the #QueerDnD hashtag, reddit comments about when D&D changed from 4th to 5th edition, tweets about a recent legal case ...
- And Data is Beautiful and Bloomberg both have a wealth of examples
So, come up with a list of 3–10 data sources of some interest to you or your friends/family members. For each, say why it interests you and say a few ideas of what open-ended questions you might ask of it. (If you list fewer data sources, I expect you to write more about each one. If you list more data sources, then you can write less about each.)
The more precise you can be the better. Instead of saying, "I'll collect data from the internet," try to say the specific site or give at least one specific example.
Once you're done jotting down your brainstorming ideas, post it to the discussion board for this Check-in. In the next Check-in you'll comment on each others' ideas.
Submission
Post your answers/materials for the check-in above to Blackboard to the discussion board for this assignment.
Grading
The entire term project process, from this first check-in all the way up to the final draft, is worth 31/100 towards your final grade.
This first check-in is only worth 1/100, so it is small, there are no wrong answers, and it will set the stage for you to do well on all those other points.