View My GitHub Profile

My GA Alumni Profile

Connect on LinkedIn

Guang Yi has the ability to identify any musical note without the benefit of a reference tone - also known as perfect pitch or absolute pitch.

When he discovered that not everyone perceives the world the same way as him, his interest to find out why this happened grew, which led him to neuroscience research, and eventually to data science.

This GitHub page shows a sample of some projects Guang Yi has worked on.

Guang Yi’s GitHub Page


General Assembly’s Capstone Project

With my background in neuroscience and classical music, I latched on to the subset of people that are interested in machine-generated music. The most important questions I wonder about are:

  1. What purpose does music serve humanity, if any?
  2. Is music inherently human?
    • Can a machine learn to create music?
    • Can the music a machine composes evoke the same emotion that human music can?
    • Can a machine learn to compose like the musical giants of the past and present?
      • Examples: Adele, Bach, Beethoven, Chopin, Coldplay, Gershwin, Taylor Swift

Topics covered: Machine learning, deep learning, neural networks, sequence-based predictions


General Assembly’s Project 3

You work for a real estate company interested in using data science to determine the best properties to buy and re-sell. Specifically, your company would like to identify the characteristics of residential houses that estimate the sale price and the cost-effectiveness of doing renovations.

Topics covered: Linear regression, logistic regression, L1/L2 penalties (ridge/lasso)


General Assembly’s Project 2

Your hometown mayor just created a new data analysis team to give policy advice, and the administration recruited you via LinkedIn to join it. Unfortunately, due to budget constraints, for now the “team” is just you…

The mayor wants to start a new initiative to move the needle on one of two separate issues: high school education outcomes, or drug abuse in the community.

Also unfortunately, that is the entirety of what you’ve been told. And the mayor just went on a lobbyist-funded fact-finding trip in the Bahamas. In the meantime, you got your hands on two national datasets: one on SAT scores by state, and one on drug use by age. Start exploring these to look for useful patterns and possible hypotheses!

Topics covered: Exploratory data analysis, data cleaning, plotting


General Assembly’s Project 1

You are an analyst at a “scrappy” online gaming company that specializes in remakes of last year’s fads.

Your boss, who runs the product development team, is convinced that Pokemon Go’s fatal flaw was that you had to actually move around outside. She has design mock-ups for a new game called Pokemon Stay: in this version players still need to move, but just from website to website. Pokemon gyms are now popular online destinations, and catching Pokemon in the “wild” simply requires browsing the internet for hours in the comfort of your home.

She wants you to program a prototype version of the game, and analyze the planned content to help the team calibrate the design.

Topics covered: Basic knowledge of statistics and python.