I shifted the project focus since last time. Namely, I’m focusing on language shift, and upon realizing that there is no PUMA-level data for the decennial census, reduced the data to only use the American Community Survey. Today, I mainly worked on filtering the ACS data through some Rust code (example JSON output is shown below).
The code splits up the raw data (which includes all surveyed participants) into chunks split up by location and generation for each language being studied (currently the 5 most spoken languages in California). I then am able to calculate aggregated statistics like the immigration rate for each year within a generation, total language speakers per year, and the linguistic diversity of a given region along with the linguistic diversity of each constituent PUMA (as shown below).