DBSCAN Clustering Algorithm

DBSCAN is an example of clustering algorithm. Clustering is the process of organizing a collection of concrete or abstract things into classes of related objects. A cluster is a group of data objects that are distinct from the objects in other clusters yet comparable to one another within the same cluster. There are various example […]

Vector Space Model in NLP

Lets begin with fundamentals for vector space model. All NLP applications require a basic understanding of how word vectors are represented in natural language processing. Because the first step in creating any NLP model is to express the word as some kind of number or coding. For instance, a word can be represented as a […]

Making People Search Tool in 2022 Using BeautifulSoup

Hello and welcome back everyone, in this part of the blog I am going to share how can we create our own people search tool using BeautifulSoup and Python. We will leverage the Google search engine to search for the profiles of people based on keyword and we will use GitHub’s search to do people […]

Basics of Natural Language Processing (NLP) Pipeline

Images in this NLP blog are borrowed from lecture note of Prof. Bal Krishna Bal. The end-to-end structure that orchestrates the flow of data into and output from a NLP model is known as a natural language processing pipeline (or set of multiple models). It covers the input of the raw data, the features, the […]

Approaches for Natural Language Processing (NLP)

Several widely used NLP algorithms Images in this blog are borrowed from lecture note of Prof. Bal Krishna Bal. Not only are there many algorithms for machine learning tasks, but also for natural language processing job. The choice of which algorithm is best for a given problem cannot be made with absolute certainty. After determining […]

Basic of Natural Language Processing (NLP)

Basic Introduction of Natural Language Processing (NLP) Images in this blog are borrowed from lecture note of Prof. Bal Krishna Bal. NLP is a Short form of natural language processing. It is sub-field of artificial intelligence.The study of how to program computers to comprehend and use natural language led to the development of the area. […]

Kruskal Wallis H Test in News Data

Kruskal Wallis H Test What is Kruskal Wallis H Test Kruskal Wallis H test is a kind of non parametric test which means that there is no presence of parameter and parent population from which sample has been taken is not normally distributed. Kruskal Wallis H test is also known as non parametric version of […]

Apache Superset for COVID Dashboards

Apache Superset is a very useful and easy-to-use visualization and dashboard-making tool that can be an alternative to tools like Tableau and PowerBI. In this blog, we will explore how we can create awesome data dashboards using Apache superset with little to no code at all. But there are a few things one should do […]

Multilayer Perceptron: Solving XOR Problem

Multilayer Perceptron: Solving XOR Problem from Scratch in Python In this blog we are going to explore how non-linear problem like XOR can be solved using multi layer perceptron. We have already tried how to apply multi layer perceptron on majority function please have a look here. We all are familiar with single layer perceptron […]

Taking Data Apps into WebApp: Using Streamlit, Plotly, and Python

Introduction From the past 2 stories of a data and its journey to confess the insights, we have explored several areas and to point out few: We have done EDA based on descriptive and inferential part of the statistics to find strong evidences, relationships and facts about the data. We used some of valuable insights […]

Scroll to top
Subscribe to our Newsletter

Hello surfer, thank you for being here. We are as excited as you are to share what we know about data. Please subscribe to our newsletter for weekly data blogs and many more. If you’ve already done it, please close this popup.



No, thank you. I do not want.
100% secure.