Scikit-Learn House Price AI
28 Nov 2022Process:
Use Pandas to read a csv into a DataFrame. Enumerate the data to get a frame with only numbers. Check for unusable data and use imputation, if needed, to insert data. After inspecting graphs of the data, normalize the data and filter it accordingly. Split the data into training and test sets, and then model the data using KNeighborsRegressor model and train the data. Then, we predict on the test set and measure the error. Finally, we fiddle with the model a bit to find the most accurate one, and then we’re done.
Snippets:
Information Gain on Parameters (VarianceThreshold Not Pictured):
The r2 Values:
The Final Model:
See the full project on Github