Skip to content
  • Blog
  • YouTube
  • Discord
  • Services
    • Streamlit Developer for Hire
    • Hire n8n automation engineer
Data Freelancing
Mentorships

Blog

  • Home
  • Blog
  • Page 2
Statistics

python quantiles statistics

July 21, 2025 Ryan Nolan No comments yet

In Python, a quantile is a statistical term used to describe a point or value below which a certain proportion of the data falls. It means a quntile split data into intervals. We start by importing numpy and pandas. numpy is used for high-performance numerical computation. Pandas is used for data manipulation, data analysis and […]

Python Pandas

Python Pandas Data Cleaning

July 19, 2025 Ryan Nolan No comments yet

https://www.espncricinfo.com/records/highest-career-batting-average-282910 Here, we read the CSV file names ‘CricketTestMatchData.csv’ into a DataFrame called df using the read_csv. Here, we check for missing null values in the DataFrame df. It returns a Boolean result for each column. It returns True if the colun has  any missing values and False if it doesn’t. This line filters the […]

Python Pandas

Pandas Columns

July 19, 2025 Ryan Nolan No comments yet

Pandas Dataframes are composed of Rows and Columns. In this guide we are going to cover everything you need to know about working with columns. The article is based on a tutorial we published on our YouTube channel. Feel free to check it out below. Let’s start with importing in Pandas and NumPy. Here we […]

Python Pandas

Pandas Resample

July 19, 2025 Ryan Nolan No comments yet

The .resample() method in pandas works similarly to .groupby(), but it is specifically designed for time-series data. It groups data into defined time intervals and then applies one or more functions to each group. This method is useful for both upsampling—where missing data points can be filled or interpolated—and downsampling, which involves aggregating data over […]

Python Pandas

Python Pandas JSON

July 19, 2025 Ryan Nolan 1 comment

JSON (JavaScript Object Notation) is a lightweight, human-readable data interchange format that is widely used for both data storage and transfer. It is structured using key-value pairs and supports various data types, including strings, numbers, booleans, arrays, and nested objects. JSON is a standard format commonly used in APIs and web data, which makes it […]

Web Scraping

beautifulsoup pagination

July 19, 2025 Ryan Nolan No comments yet

import requests – Allows us to make HTTP requests to web pages. from bs4 import BeautifulSoup –It is used to parse and extract data from HTML content. import pandas as pd – It is used for organizing and manipulating data in table format. import re – It enables pattern matching using regular expressions. from time […]

scikit-learn

adaboost classifier

July 12, 2025 Ryan Nolan No comments yet

Adaptive Boosting, or AdaBoost, is a boosting algorithm that combines multiple low-accuracy (weak) models to form a single high-accuracy (strong) model. It works by sequentially training these weak learners, each one focusing more on the errors made by the previous ones. Any machine learning algorithm that supports weighted training samples—such as Decision Trees, Logistic Regression, […]

Python

Gradient boosting classifier

July 12, 2025 Ryan Nolan No comments yet

Gradient Boosting is an ensemble technique that builds a strong model by combining multiple weak decision trees. While it may seem similar to a Random Forest, there’s a key difference: in Random Forests, each tree is built independently, whereas in Gradient Boosting, trees are built sequentially, with each new tree correcting the errors of the […]

scikit-learn

Kaggle House price prediction Regression Analysis

July 12, 2025 Ryan Nolan No comments yet

train_df = train_df.drop(columns=[‘PoolQC’, ‘MiscFeature’, ‘Alley’, ‘Fence’, ‘GarageYrBlt’, ‘GarageCond’, ‘BsmtFinType2’]) test_df = test_df.drop(columns=[‘PoolQC’, ‘MiscFeature’, ‘Alley’, ‘Fence’, ‘GarageYrBlt’, ‘GarageCond’, ‘BsmtFinType2’]) #drop GarageArea or GarageCars #build models

scikit-learn

kaggle titanic tutorial

July 12, 2025 Ryan Nolan No comments yet

#military – Capt, Col, Major #noble – Jonkheer, the Countess, Don, Lady, Sir #unmaried Female – Mlle, Ms, Mme #NEW Drop Sibsp, Parch, TicketNumberCounts #OLD #X = train_df.drop([‘Survived’], axis=1) #y = train_df[‘Survived’] #X_test = test_df.drop([‘Age_Cut’, ‘Fare_Cut’], axis=1)

Posts pagination

Previous 1 2 3 … 16 Next

Search

Categories

  • LangChain (3)
  • LeetCode (8)
  • Linear Algebra (7)
  • N8N (9)
  • Python (28)
  • Python Pandas (36)
  • scikit-learn (29)
  • Statistics (24)
  • Time Series (6)
  • Uncategorized (2)
  • Web Scraping (5)

Recent posts

  • n8n openai api
  • N8N Perplexity
  • Streamlit Full Course

Helping Data Professions further there careers

Important Links
  • Blog
  • Sponsorships
  • Mentorships
  • Data Freelancing
LinkedIn
  • Ryan Nolan
  • Matt Payne
Get in touch
  • ryannolandata@gmail.com

© Ryan & Matt Data Science

  • Terms & Conditions
  • Privacy Policy