Data Analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. 1

Select your tag(s) below:

  • StyleGAN - AI image generation

    StyleGAN - AI image generation

    See that picture of a person? AI did that, and that person doesn’t exist.

  • What Great Data Analysts Do

    What Great Data Analysts Do

    Different types of data people: Machine Learning, Statistics, and Analytics

  • Generating datasets to learn Data Science

    Generating datasets to learn Data Science

    A brief rundown of methods/packages/ideas to generate synthetic data for self-driven data science projects and deep diving into machine learning methods.

  • Towards Data Science

    Towards Data Science

    Quickly becoming a regular check-in for ideas and methods for Data Science.

  • Pulling Google Analytics into a Jupyter Notebook

    Pulling Google Analytics into a Jupyter Notebook

    Chapter 6 of a series from Humanlytics complete with code examples and more.

  • Pandas Python Data Analysis Library

    Pandas Python Data Analysis Library

    If you are manipulating large datasets in excel and are tired of dealing with long wait times, check this out.

  • The Jupyter Notebook

    The Jupyter Notebook

    Killer environment to help you learn and test python, plus much more.

  • Mike Bostock

    Mike Bostock

    Creator of D3 and Observable. Former data scientist at the New York Times.

  • lantrns.co's experience with DBT

    lantrns.co's experience with DBT

    A roundabout tutorial to a SQL tool called the Data Build Tool (dbt).

  • D3 Official Tutorials

    D3 Official Tutorials

    You know those cool New York Times charts you see from time to time? They probably use this.

  • Observable Data Notebooks

    Observable Data Notebooks

    A collaborative platform for data science built by the founder of D3.

  • Congressional Districts Geo-helpers

    Congressional Districts Geo-helpers

    A poorly structured collection of relationship files helpful with matching back zip codes, counties, etc, to congressional disctricts.

  • SQL on Codecademy

    SQL on Codecademy

    Finished it. Pretty decent. Took notes.

  • SQL on Codewars

    SQL on Codewars

    Going to get to this after Codecademy.

  • Mode Analytics

    Mode Analytics

    Kind of want to check this out, but when I first looked, the docs were limited.

  • Astronomer - The Apache Airflow Starter

    Astronomer - The Apache Airflow Starter

    I honestly have no idea what this does yet, but I want to look into it. Looks like some kind of workflow management tool.

  • Coding Is For Losers (YouTube)

    Coding Is For Losers (YouTube)

    Supercharge your data pipeline using the Coding Is For Losers stack. From data collection to visualization, plus some extra tactics.

  • Singer: open-source ETL by the Stitch folks.

    Singer: open-source ETL by the Stitch folks.

    Another ETL with the option to create your own connectors; think of an ETL as Zapier or IFTTT but for data.

  • Stitch: extensible ETL for data management.

    Stitch: extensible ETL for data management.

    Just getting started exploring this one, but I can already see a world of possibilities.

  • LinkedIn: The Economic Graph

    LinkedIn: The Economic Graph

    This is LinkedIn’s Data on the Work Force. It includes where workers are leaving and where workers are going.

  • Bureau of Labor Statistics

    Bureau of Labor Statistics

    Link to the U.S. Bureau of Labor Statistics data section.

  • Wil Reynold's Slideshare

    Wil Reynold's Slideshare

    The Founder of Seer Interactive’s slideshare. Includes 50+ decks most recently on data visualization.

  • U.S. Census: Directory Structure

    U.S. Census: Directory Structure

    U.S. Census data in directory structure.

  • IRS Individual Income Tax Stats

    IRS Individual Income Tax Stats

    Directory of IRS individual income tax data by zip code and year.

  • Tell a story that translates Google Ads reports into action

    Tell a story that translates Google Ads reports into action

    If you’re new to reporting, check out this article and short video for some ideas to kickstart your dashboards.

  • reddit.com/r/dataisbeautiful

    reddit.com/r/dataisbeautiful

    I check this subreddit periodically for new ideas on data visualization. You’ll find the work of some very talented data scientists here.

  • Visualizing Clusters of Clickbait Headlines

    Visualizing Clusters of Clickbait Headlines

    Step by step guide to a 100,000+ headline analysis of clickbait using Apache Spark and Word2vec by Max Woolf.

  • Data.World

    Data.World

    A huge, community driven library of datasets with connectors to Power BI and Google Data Studio.

  • CityLab Places Blog

    CityLab Places Blog

    I need to do a better job keeping up with this one. Anyone dealing with Places or Planning should check this out.

  • Zillow Research Data

    Zillow Research Data

    Zillow’s collection of data on everything from zestimates to home sale counts and prices.

  • CityObservatory Blog

    CityObservatory Blog

    Fantastic data blog on cities.