You may have heard this saying:“All data scientists are data analysts, but not all data analysts are data scientists.” By the end of this article, this statement should make perfect sense https://www.youtube.com/watch?v=EEVScxGDlFk Having worked as both a data anaylst and a data scientist, i have seen firsthand how the roles overlap and where they diverge. […]
FAISS LangChain
FAISS (Facebook AI Similarity Search) is a vector library developed by Facebook that is used to store and search embeddings efficiently. It is particularly useful for tasks like question answering within documents, where you need to retrieve relevant parts of the content based on semantic similarity. By converting text into embeddings, FAISS allows you to […]
gradient boosting regressor
Boosting in machine learning is a technique that combines multiple simple models, often decision trees into a single, stronger model. It works with regression trees and improves performance by sequentially learning from the mistakes of previous models. According to the scikit-learn documentation, at each stage, a regression tree is fit on the negative gradient of […]