Skip to content
  • Blog
  • YouTube
  • Skool Community
  • Services
    • Streamlit Developer for Hire
    • Hire n8n automation engineer
Data Consulting
Mentorships

Blog

  • Home
  • Blog
  • Page 11
scikit-learn

Simple Imputer

July 3, 2025 Ryan Nolan No comments yet

When working with data in Python, especially using pandas, handling missing values is a crucial step in data cleaning. Missing values can occur in both categorical and numeric columns. There are several common strategies to address them: you can choose to ignore them (though this is rarely recommended), remove the rows that contain them using […]

scikit-learn

Logistic Regression

July 3, 2025 Ryan Nolan No comments yet

Logistic regression is a statistical model used for binary classification problems, where the goal is to predict one of two possible outcomes. Unlike linear regression, which predicts continuous values, logistic regression estimates the probability that a given input belongs to a particular class. It uses the logistic (sigmoid) function to map predicted values between 0 […]

scikit-learn

Decision Tree

June 27, 2025 Ryan Nolan No comments yet

A decision tree is a non-parametric supervised learning algorithm, which is utilized for both classification and regression tasks. It has hierarchical, tree structure, which consists of a root node, branches, internal nodes and leaf nodes. note: Parametric supervised learning refers to a type of machine learning where the model assumes a specific functional form and estimates […]

scikit-learn

Voting Classifier

June 25, 2025 Ryan Nolan No comments yet

Boosting Accuracy with Voting Classifiers In machine learning, combining multiple models often leads to better performance than relying on a single one. A Voting Classifier is a simple ensemble method that does just that — it aggregates predictions from several models to improve accuracy. There are two types: Hard Voting: Takes the majority vote from […]

scikit-learn

Elastic Net Regressor

June 25, 2025 Ryan Nolan No comments yet

Elastic Net regression is a linear regression method that merges the strengths of both Lasso (L1) and Ridge (L2) regression techniques. It helps reduce overfitting and is especially effective when working with datasets that have many features, particularly when some of those features are highly correlated. The model’s regularization is controlled by two key hyperparameters: […]

scikit-learn

gradient boosting regressor

June 24, 2025 Pere No comments yet

Boosting in machine learning is a technique that combines multiple simple models, often decision trees into a single, stronger model. It works with regression trees and improves performance by sequentially learning from the mistakes of previous models. According to the scikit-learn documentation, at each stage, a regression tree is fit on the negative gradient of […]

Python

Python For Loop

June 24, 2025 Ryan Nolan No comments yet

Loops are used to repeat a block of code multiple times. “for” loop in python is used when the number of repetition is known. “while” loop is used when the number of repetition is not known in advance, or can be infinite but there have to be a condition for stopping or exiting out the […]

scikit-learn

Random Forest Regressor

June 24, 2025 Ryan Nolan No comments yet

Random forest regressor is a variant of the random forest classifier. It is primarily used for classification tasks. This model is an ensemble of decision trees. It combines the predictions of multiple individual trees to imrpove performance. By aggregating the results from those trees, typically through votng or avaeraging. It produces a final prediction that […]

Python

Python Dictionaries

June 22, 2025 Ryan Nolan No comments yet

A Python dictionary is a built in data type used to store key-value pairs. It is similar to a real dictionary where you have to look up a word (key) to find its defintion (value) In Python,  a dictionary stores data in linked key-value pairs Each key is connected to a specific value, making it […]

N8N

Connect Google Drive to N8N

June 21, 2025 Ryan Nolan No comments yet

Setting up your Google Drive connection to N8N has to be one of the first things you need to do when starting to explore the software. Google Drive is used a ton within workflows and can really expand your possibilities of builds. In this lesson, I’ll take you step by step to getting this ready […]

Posts pagination

Previous 1 … 10 11 12 … 24 Next

Search

Categories

  • LangChain (3)
  • LeetCode (8)
  • Linear Algebra (7)
  • N8N (37)
  • Python (31)
  • Python Pandas (36)
  • Sbert (1)
  • scikit-learn (29)
  • Statistics (24)
  • Streamlit (43)
  • Time Series (6)
  • Uncategorized (4)
  • Web Scraping (5)

Recent posts

  • n8n RAG Embeddings with OpenAI
  • n8n RAG Text Splitters
  • n8n Binary Data

Helping Data Professions further there careers

Important Links
  • Blog
  • Sponsorships
  • Mentorships
  • Data Freelancing
LinkedIn
  • Ryan Nolan
  • Matt Payne
Get in touch
  • ryannolandata@gmail.com

© Ryan & Matt Data Science

  • Terms & Conditions
  • Privacy Policy