https://youtu.be/GMF4Td7KtB0#Ridge Regression which is considered #L2 Regularization #helps with overfitting in linear regression models #keeping the coefficients small # lead to a model that is less prone to overfitting #balance between fitting the data and keeping the coefficients small #more robust and stable models, particularly when dealing with datasets that have highly correlated predictor variables […]
Pandas datetime
In this Python Pandas lesson, we will take a look how Datetime works. If you would rather watch a YouTube video then read the article, the video the article is based around is linked below. https://youtu.be/1tmN3gbimvI?si=_fvXRgfzoFlm7deM Importing Required Libraries we start by importing three libraries: pandas as pd: for data manipulation and analysis numpy as […]
Stacking Regressor
SEE ALL NULL VLAUES voting classifier hyperparamater tuning
BeautifulSoup4 extract table
import requests import pandas as pd from bs4 import BeautifulSoup Basic Example HTML Code -> Runners html = “”” Personal Running Bests Personal Running Bests Distance Time 5k 18:30 10k 37:50 Half Marathon 1:25:11 Marathon 3:17:00 50 Miler 9:14:30 100 Miler 32:11:11 “”” Extract headers headers = [th.get_text(strip=True) for th in table.find_all(“th”)] headers Step 5: […]
Augmented Dickey–Fuller test
parts of the ADF test – tuple access by index
KPSS-test
parts of the ADF test -tuple access by index
Multicollinearity
dividing the total number of bases a player records by their total number of at-batsmaybe replace this with something else? CORRELATION MATRIX VIF Instead of using raw height, you might normalize or categorize height into bins, which could reduce the numerical interdependence.Calculate Condition Index (CI) How to address MulticollinearityDrop a Feature (At Bats) look at […]
Pandas Sample
We are going to be looking at Pandas Sample(). The sample() method returns a specified number of random rows. it also returns one row if a number is not specified https://youtu.be/REhRhRUcluI Example 1 – if else state location To start with, we are going to be importing various libraries. pandas as pd random string numpy […]
Pandas MultiIndex
Working with structured data in Python often calls for more than just a flat table. When your dataset has multiple levels of information—like years and quarters, countries and cities, or products and categories—Pandas MultiIndex can be a powerful tool. It allows you to represent hierarchical relationships within your data, enabling advanced analysis, cleaner code, and […]