• Blog
  • YouTube
  • Discord
Data Freelancing
Mentorships

Blog

  • Home
  • Blog
  • Page 6
scikit-learn

machine learning imbalanced classes

June 6, 2025 Ryan Nolan No comments yet

#Read over#data professor#emma Ding#mahesh huddar#ritvik mathPart 1 Load a Dataset Part 2 SIMPLE EDA Part 3 Set Up the Data Part 4 BASELINE MODEL – NO FIXING THE IMBALANCE part 5Oversampling ExampleOversampling Example 1 RandomOverSampler To start we’re going to create a simple dataframe in python led to overfitting part 6Oversampling Method Example 2 SMOTE […]

Statistics

simpsons paradox in Python

June 6, 2025 Ryan Nolan No comments yet

2nd example Running 1st plot just the mile time/miles per week

Statistics

two sample z test scipy

June 6, 2025 Ryan Nolan No comments yet

#Example 1 More Manual – From Slides Quicker way to test it – Not Entirely preciseThe ztest function in statsmodels.stats.weightstatsdoes not explicitly allow for directlypassing the population standard deviation. Instead, it estimates the standard error based on thesample standard deviations unless the sample variance is explicitly pooled example 2 marathon times of two running clubs […]

Statistics

paired sign test in Python

June 6, 2025 Ryan Nolan No comments yet

Example 1 Example 2 #one tail #zero value #shapiro #ordinal data Example statsmodel

scikit-learn

Column Transformer

June 6, 2025 Ryan Nolan No comments yet

#drop #Example Passthrough some columns, drop offthers

scikit-learn

extra trees classifier

June 6, 2025 Ryan Nolan No comments yet

Aggregates the results from group of decision trees (Like a random forest) Difference 1. ETC randomly selects the value to split features unlike a DTC which looks for the best2. Makes ETC More random + Faster Algorithm which can help with noisy data

Linear Algebra

Python Reduced Row Echelon Form

June 6, 2025 Ryan Nolan No comments yet

From REF to RREF

scikit-learn

Lasso Regression

June 6, 2025 Ryan Nolan No comments yet

https://youtu.be/LmpBt0tenJE#LASSO stands for Least Absolute Shrinkage and Selection Operator#L1 regularization #address overfitting – A model that is too complex may fit the training data very well#but perform poorly on new, unseen data #will get rid ofe useless features (make coefficients independent var next to 0)#- automatic feature selection # lead to a simpler model that […]

Linear Algebra

Python Augmented Matrix

June 5, 2025 Ryan Nolan No comments yet

Example 1 Example 2 row operations Multiply the first row by 3 add: 2nd row to the first subtract 2nd row from the 3rd

Web Scraping

web scraping with python

June 5, 2025 Ryan Nolan No comments yet

import requests from urllib.parse import urljoin import urllib.robotparser web scraping with python Part 1 Getting your first page def response_code(response): if response.status_code == 200: print(“Page fetched successfully!”) else: print(“Failed to retrieve page:”, response.status_code) URL = “http://books.toscrape.com/” url_response = requests.get(URL) response_code(url_response) Failed to retrieve page: 403 | a client does not have the necessary permissions to […]

Posts pagination

Previous 1 … 5 6 7 … 14 Next

Search

Categories

  • LangChain 3
  • LeetCode 8
  • Linear Algebra 7
  • N8N 2
  • Python 26
  • Python Pandas 31
  • scikit-learn 26
  • Statistics 22
  • Time Series 6
  • Uncategorized 2
  • Web Scraping 4

Recent posts

  • python variance and standard deviation
  • FAISS LangChain
  • hyperparameter tuning with scikit learn

Helping Data Professions further there careers

Important Links
  • Blog
  • Sponsorships
  • Mentorships
  • Data Freelancing
LinkedIn
  • Ryan Nolan
  • Matt Payne
Get in touch
  • ryannolandata@gmail.com

© Ryan & Matt Data Science

  • Terms & Conditions
  • Privacy Policy