Scikit-Learn House Price AI

28 Nov 2022

Goals: Get accustomed to Jupyter Notebooks, Scikit-Learn, and simple regression AI modeling. Learn concepts such as normalization, imputation, enumeration, the foundations of CRISP-DM, and the basics of AI modeling.

Results: Model with an r² of 0.9999968, but is not general to any house due to location data being factored into the model.

Process:

Use Pandas to read a csv into a DataFrame. Enumerate the data to get a frame with only numbers. Check for unusable data and use imputation, if needed, to insert data. After inspecting graphs of the data, normalize the data and filter it accordingly. Split the data into training and test sets, and then model the data using KNeighborsRegressor model and train the data. Then, we predict on the test set and measure the error. Finally, we fiddle with the model a bit to find the most accurate one, and then we’re done.

Snippets:

Information Gain on Parameters (VarianceThreshold Not Pictured):

The r² Values: R-Squared Values

The Final Model:

See the full project on Github

Neil Ghugare

Scikit-Learn House Price AI

Related Posts

AI Docking Port Locator and Distance Regressor for the ISS 23 Feb 2025

Book Recommendation System Using Goodreads Dataset 12 Dec 2024

Dementia Classification AI 28 Sep 2024