Level Up: Linear Regression in Python - Part 1

Article hero image

Linear regression is a machine learning technique for modeling continuous outcomes. It is used for both prediction and data analysis in a variety of different fields. It is also the basis for a number of other machine learning models, including logistic regression and poisson regression. For anyone who is interested in learning more about data science and statistics, or for anyone who wants to read and understand research papers more easily, linear regression is a great place to start!

The Codecademy Live: Linear Regression in Python series will be hosted by Sophie Sommer, a Curriculum Developer at Codecademy and creator of the Linear Regression in Python course on Codecademy. She has a masters degree in Applied Statistics from NYU and six years of classroom teaching experience, working with middle school through masters-level students.

The live series will start with a simple linear regression model and slowly build toward more complex and flexible models that can handle real-world (and messy) data. We'll mostly follow the Linear Regression in Python course, but will cover some bonus topics as time permits.

Both the course and the stream are free for anyone! We'll also be hosting 30 minutes of office hours on Thursdays at 11am EDT through at least June 3rd. During the office hours, anyone is welcome to join and ask questions about anything from the livestream or course. If you want to join those sessions, you can find more information on our Events Page. We look forward to meeting some of you in those sessions!

In the first lesson of the series, we'll be covering the basics of simple linear regression with a quantitative predictor. We'll use a small dataset to build a linear regression model that predicts weight based on height. In the process, we'll demo how to use a Jupyter notebook and introduce some common Python packages for data analysis. We'll also discuss some of the assumptions of linear regression and teach you to fit a simple model in Python.

Here are some Stack Overflow questions related to the work we did in today's session:

OLS Regression: Scikit vs. Statsmodels

How to Plot Statsmodels Linear Regression (OLS) Cleanly

If you want to ask any questions or provide feedback on the lesson, you are welcome to leave a comment on the YouTube recording of this lesson. If you’d like to watch a session live, follow the Codecademy YouTube channel.

Finally, if you want even more linear regression content, you can sign up for the Linear Regression in Python interactive course this series was based on. This course was developed by Sophie and has many more quizzes, projects, and helpful nuggets that we can’t fit into our streams!

Login with your stackoverflow.com account to take part in the discussion.