HowTo

Text Classification with Machine Learning Using Udemy Dataset and Python

In this tutorial- which is part of the End-To-End Data Science Project using the Udemy Dataset – we will perform text classification using the title and the subject category. Our aim behind this project is to predict the subject category giving the course title. By the end of this tutorial you will learn What we …

Text Classification with Machine Learning Using Udemy Dataset and Python Read More »

Sentiment Analysis in Golang

In this tutorial,which is part of the series Go4DataScience and Go4NLP in which we experiment with Go for doing Data Science, we will be performing sentiment analysis in Golang. By the end of this tutorial you will understand and learn about : What sentiment analysis is Why sentiment analysis Its applications. How to perform sentiment …

Sentiment Analysis in Golang Read More »

Building Machine Learning Pipelines with Scikit Learn & Python

The normal everyday data science/ML workflow follows a particular pattern of taking in data, analyzing the data and then deriving useful insights to build problem solving and predictive tools. One of the simplest and most useful ways of working with such a workflow is to use pipelines. So what is a Pipeline? Why is it …

Building Machine Learning Pipelines with Scikit Learn & Python Read More »

fake data generator app

Building A Fake Data Generator App with Streamlit and Python Faker

Data is everywhere, but sometimes when building a web site or testing out a product you may need quick data to use to test drive your app or product. This is where the Faker library comes to play. The idea behind Faker is quite simple to generate data randomly per certain fields. This library has …

Building A Fake Data Generator App with Streamlit and Python Faker Read More »

How to Save Uploaded Files to Directory in Streamlit Apps

Sometime ago we learnt about how to upload files and process the uploaded files in Streamlit. We dived deeper into the Streamlit UploadedFile Class and how it reads it as a file-like Byte IO type. In this tutorial we will learn about how to save our uploaded file into a directory. This is useful in …

How to Save Uploaded Files to Directory in Streamlit Apps Read More »

Using Golang For DataScience

Using Go For Data Science.Is it Easy?

Go also know as Golang is a simple yet powerful programming language created around 2009 and backed by Google. Go is  fast, statically typed, compiled language that feels like a dynamically typed, interpreted language. With a lot of features built within it, it is a great tool. In this series of tutorials and posts we …

Using Go For Data Science.Is it Easy? Read More »

pypolars tutorial

PyPolars – Data Analysis with PyPolars – a Pandas Alternative

Pandas is one of the best libraries for doing data analysis in general. It is so popular and useful that it has become the defactor DataFrames library when doing Data Science in Python. However, there are times that your dataset may be too big for Pandas and too small for PySpark – this is where …

PyPolars – Data Analysis with PyPolars – a Pandas Alternative Read More »