Published inDev GeniusDifferent Storage Levels in Spark with IllustrationsIn this article I would like to cover1d ago1d ago
Published inDev GeniusShuffle Partitions and DataFrame Partitioning Are Not The SameSince starting my journey in data engineering, I have always heard this word ‘partition(ing)’ and for valid reasons. In simple terms, it is…Feb 20Feb 20
Published inDev GeniusWhy Use Surrogate Keys in Your Data Warehouse?What is a Surrogate Key?Feb 14Feb 14
How to Retrieve Match Statistics using Network Requests and Python’s Requests LibraryGetting all match stats can be a daunting task, especially if you are an Indian football fan who doesn’t have access to high-quality data…Apr 2, 20231Apr 2, 20231
Getting Started With Indian Super League DataIn this piece, I’ll walk you through how you can get started with data analysis specific to Indian Football. We’ll use Python and Jupyter…Oct 6, 20221Oct 6, 20221
Getting Started with Google Tag Manager — Learn How Websites Collect DataThis article was published as a part of the Data Science Blogathon.Sep 6, 2022Sep 6, 2022
The Bayes Theorem & FootballThis article was published as a part of the Data Science Blogathon conducted by analyticsvidhya.comJul 30, 2022Jul 30, 2022
How I created an Expected points table for Indian Super League using PythonIn my previous article, I covered how I scraped stats such as xG, xG open play from fotmob.com. I’ll be using the data collected from that…Mar 27, 20221Mar 27, 20221
Scraping Advanced Football Stats for the Indian Super League using PythonIndian Super League (also referred to as ISL) is one of the budding leagues in Asia. According to me, the league has the potential to…Mar 17, 2022Mar 17, 2022