https://youtu.be/GMF4Td7KtB0#Ridge Regression which is considered #L2 Regularization #helps with overfitting in linear regression models #keeping the coefficients small # lead to a model that is less prone to overfitting #balance between fitting the data and keeping the coefficients small #more robust and stable models, particularly when dealing with datasets that have highly correlated predictor variables […]
Stacking Regressor
SEE ALL NULL VLAUES voting classifier hyperparamater tuning
BeautifulSoup4 extract table
import requests import pandas as pd from bs4 import BeautifulSoup Basic Example HTML Code -> Runners html = “”” Personal Running Bests Personal Running Bests Distance Time 5k 18:30 10k 37:50 Half Marathon 1:25:11 Marathon 3:17:00 50 Miler 9:14:30 100 Miler 32:11:11 “”” Extract headers headers = [th.get_text(strip=True) for th in table.find_all(“th”)] headers Step 5: […]
Augmented Dickey–Fuller test
parts of the ADF test – tuple access by index
KPSS-test
parts of the ADF test -tuple access by index
Multicollinearity
dividing the total number of bases a player records by their total number of at-batsmaybe replace this with something else? CORRELATION MATRIX VIF Instead of using raw height, you might normalize or categorize height into bins, which could reduce the numerical interdependence.Calculate Condition Index (CI) How to address MulticollinearityDrop a Feature (At Bats) look at […]
Pandas Sample
We are going to be looking at Pandas Sample(). The sample() method returns a specified number of random rows. it also returns one row if a number is not specified https://youtu.be/REhRhRUcluI Example 1 – if else state location To start with, we are going to be importing various libraries. pandas as pd random string numpy […]
Pandas MultiIndex
We are going to be lloking at Pandas Multiindex https://youtu.be/O6Lv9nyN0i4?si=YMIZleZU2B-5zjcy to start with, we would import pandas as pd import pandas as pd Example 1 In this example, we create a Python dictionary “data_races, that stores race information, including race names, the year they occured, and a difficulty rating for each event data_races = { […]