**Tag:**data science

**Total:**60 Posts

Posts of Tag: data science

## Get Feature Importances for Random Forest with Python and Scikit-Learn

.lazyload-placeholder { display: none; } Introduction The Random Forest algorithm is a tree-based supervised learning algorithm that uses an ensemble of predicitions of many decision trees, either to classify a...Learn MorePythonMachine Learningscikit-learndata sciencematplotlibseaborn## Definitive Guide to Logistic Regression in Python

.lazyload-placeholder { display: none; } Introduction Sometimes confused with linear regression by novices - due to sharing the term regression - logistic regression is far different from linear regression. Whi...Learn MorePythonscikit-learndata sciencedata visualizationmatplotlibpandasnumpyseaborn## K-Means Clustering with the Elbow method

.lazyload-placeholder { display: none; } K-means clustering is an unsupervised learning algorithm that groups data based on each point euclidean distance to a central point called centroid. The centroids are de...Learn MorePythonAlgorithmMachine Learningscikit-learndata science## How to Fill NaNs in a Pandas DataFrame

.lazyload-placeholder { display: none; } Missing values are common and occur either due to human error, instrument error, processing from another team, or otherwise just a lack of data for a certain observation...Learn MorePythondata sciencepandasnumpy## Split Train, Test and Validation Sets with Tensorflow Datasets - tfds

Introduction Tensorflow Datasets, also known as tfds is is a library that serves as a wrapper to a wide selection of datasets, with proprietary functions to load, split and prepare datasets for Machine and Deep...Learn MorePythonMachine LearningDeep LearningValidationdata sciencetensorflow## Keras Callbacks: Save and Visualize Prediction on Each Training Epoch

Introduction Keras is a high-level API, typically used with the Tensorflow library, and has lowered the barrier to entry for many and democratized the creation of Deep Learning models and systems. When just sta...Learn MorePythonMachine Learningdata scienceartificial intelligencekerastensorflow## Feature Scaling Data with Scikit-Learn for Machine Learning in Python

Introduction Preprocessing data is an often overlooked key step in Machine Learning. In fact - it's as important as the shiny model you want to fit with it. Garbage in - garbage out. You can have the best mod...Learn MorePythonMachine Learningdata science## Hands-On House Price Prediction - Deep Learning in Python with Keras

In this short series of guides, we'll be taking a look at a hands-on house price prediction. We'll be using Keras, the deep learning API built on top of TensorFlow to train a neural network to predict the price...Learn MorePythonMachine Learningdata scienceartificial intelligencekerastensorflow## Scikit-Learn's train_test_split() - Training, Testing and Validation Sets

Introduction Scikit-Learn is one of the most widely-used Machine Learning library in Python. It's optimized and efficient - and its high-level API is simple and easy to use. Scikit-Learn has a plethora of conve...Learn MorePythonMachine LearningValidationscikit-learndata scienceartificial intelligencetesting## Searching and Replacing Words in Python with FlashText

Introduction In this tutorial, we'll explain how to replace words in text sequences, with Python using the FlashText module, which provides one of the most efficient ways of replacing a large set of words in a ...Learn MorePythondata science## Calculating Spearman's Rank Correlation Coefficient in Python with Pandas

Introduction This guide is an introduction to Spearman's rank correlation coefficient, its mathematical calculation, and its computation via Python's pandas library. We'll construct various examples to gain a b...Learn MorePythondata sciencedata visualizationpandasmathsnumpyseaborn## Guide to Multidimensional Scaling in Python with Scikit-Learn

Introduction In this guide, we'll dive into a dimensionality reduction, data embedding and data visualization technique known as Multidimensional Scaling (MDS). We'll be utilizing Scikit-Learn to perform Mult...Learn MorePythonMachine Learningscikit-learndata scienceartificial intelligencedata visualization