HowTo

An Intro to DagsHub For Data Scientists

DagsHub is a platform for data scientists and machine learning engineers to version their data, models, experiments, and code. In a way it is like Github for Data Scientists and ML Engineers. It is where Data Scientists build their ML projects without any external and complicated devops. It seeks to simplify MLOps for everyone. It allows …

An Intro to DagsHub For Data Scientists Read More »

Hash Identification Using Machine Learning and 3 Tools

A hash is unique fixed-size values or a fixed-length string of characters produced by a hash function. It is the values returned by a hash algorithm. Such values can also be termed as Message Digests or simply a Hash. A hash function is any function than can be used to map data of arbitrary size …

Hash Identification Using Machine Learning and 3 Tools Read More »

Natural Language Processing with Spark NLU

Natural Language Processing ( NLP for short) is an exciting and useful field of Data Science. Some of applications of NLP involves Text Classification Sentiment Analysis Machine Translation Chatbots Creation Keyword Extraction NER etc With the increase of textual data, comes the increase of performant and fine tuned State of the Art(SoTA) models. These models …

Natural Language Processing with Spark NLU Read More »

How to Develop and Publish Julia Packages For Beginners

Julia is an exciting and fascinating programming language like Python and R and Go. It offers several benefit if you are into scientific computing and Datascience. In this tutorial we will explore how to develop and publish Julia Packages. We will using this simple workflow for designing our package. This describes everything that we will …

How to Develop and Publish Julia Packages For Beginners Read More »

How to Programmatically Label Datasets using Snorkel in Python

Machine Learning can be grouped into two based on the presence or absence of labels or target class. This gives us the main branches of supervised machine learning (where the dataset is provided with predefined labels) and unsupervised machine learning (where there are no labels given). The task of assigning labels to a dataset is …

How to Programmatically Label Datasets using Snorkel in Python Read More »

How to Generate Documentations with MkDocs

Documentations are essential for building softwares, in fact documentation is also part of the software you design and develop. It is what differentiate a good software from not a good one. In this tutorial we will be learning how to use a nice library called mkdocs to generate documentations for our software project. As the …

How to Generate Documentations with MkDocs Read More »

How to Visualize Neural Networks

Neural Networks or preferably called Artificial neural networks forms the building block for Deep learning activities. In this article, we will explore the various ways we can visualize your neural network architecture. We will be using 4 different methods to help us visualize the components of a neural network. By the end of this tutorial …

How to Visualize Neural Networks Read More »

Data Analysis with JavaScript Using Data-Forge

Python is popular in the data science world. It has several libraries and packages for performing data analysis, data visualization, machine learning,deep learning and also for converting your models into product. Certain libraries like Pandas, Scikit-learn, Tensorflow, PyTorch, Flask, Streamlit and Plotly made Python very useful for Data Scientist and Data Engineers. JavaScript on the …

Data Analysis with JavaScript Using Data-Forge Read More »