The goal of the linear equation is to end up with the line that best fits the data. There are usually multiple independent variables, useful for analyzing complex questions with “either-or” construction. It is used when the dependent variable has two categorical options, which must be mutually exclusive. You may also hear the term “logistic regression.” It’s another type of machine learning algorithm used for binary classification problems using a dataset that’s presented in a linear format. The goal is to create a line that has as few errors as possible. The distance between a point on the graph and the regression line is known as the prediction error. The equation creates a line, hence the term linear, that best fits the X and Y variables provided. The result should be a linear regression equation that can predict future students’ results based on the hours they study. The data scientist trains the algorithm by refining its parameters until it delivers results that correspond to the known dataset. The student inputs a portion of a set of known results as training data. It’s used for finding the relationship between the two variables and predicting future results based on past relationships.įor example, a data science student could build a model to predict the grades earned in a class based on the hours that individual students study. Linear regression is a supervised learning algorithm that compares input (X) and output (Y) variables based on labeled data.
Machine Learning Algorithms expand_more.Statistics Concepts for Data Scientists expand_more.What Can You Do With a Computer Science Degree?.How to Become a Business Analyst With No Experience.Best Master’s in Information Systems expand_more.
Master’s in Data Science Programs in California.Best Master’s in Data Science expand_more.