If you’re reading this, you and I have lots of common interests.
I was daunted by the complexities of projects when I started my data science career. We were using Virutalenv in all our python projects.
I’m impressed by the Node Package Manager (npm) and always wondered why we don’t have one like that in Python.
I was yearning for a single tool to maintain isolated environments, manage dev and production dependencies, packaging, and publishing.
Thankfully, we have Poetry now.
In a nutshell, Poetry is a tool for dependency management and packaging in Python.
But this official definition is incomplete because I found Poetry does more than managing dependencies and packaging.
Describe was the first function I try on any new dataset. But I found a better one now.
I replaced it with Skimpy. It’s a small python package that shows some extended summary results for a dataset. You can also run it on a terminal window without entering a Python shell.
You can install it from PyPI using the following command.
pip install skimpy
In a previous post, I’ve shared three Python exploratory data analysis tools. With them, you can generate more complete reports about your datasets in the blink of an eye.
But what if you need a simpler…
Dash apps allow you to build interactive dashboards purely in Python. Interestingly, it could reach heights that popular BI platforms can not. Also, you can host it on your servers and your terms.
BI Platforms such as Tableau and PowerBI do a fantastic job. It allows even non-technical managers to do data exploration themselves. …
I trust the first profound attempt was in 1985. A revolutionary software changed the way we think about data. It allowed ordinary people to do extraordinary data analyses. We call it Excel, developed by Microsoft initially for Machintosh.
Since then, the field of data science has evolved and become accessible for everyone.
If you have dreams of becoming a data scientist or a data engineer, you’d probably see a black screen full of codes in that dream. Polishing your coding skills may be the popular advice you get on this journey. Yet, surprisingly, it has nothing to do with programming.
Data science is the process of making sense from a raw collection of records. A programing language is only a tool. It’s like a container for cooking your meals. But the container itself is not the meal.
People lose interest in data science because some aren’t good at programming. They couldn’t get…
I was a big fan of Apache Airflow. Even today, I don’t have many complaints about it. But the new technology Prefect amazed me in many ways, and I can’t help but migrating everything to it.
Prefect (and Airflow) is a workflow automation tool. You can orchestrate individual tasks to do more complex work. You could manage task dependencies, retry tasks when they fail, schedule them, etc.
I trust workflow management is the backbone of every data science project. Even small projects can have remarkable benefits with a tool like Prefect. It eliminates a significant part of repetitive tasks. …
I remember the good old college days where we spent weeks analyzing survey data in SPSS. It’s interesting to see how far we came from that point.
Today, we do all of them and a lot more in a single command before you even blink.
That’s a remarkable improvement!
This short article will share three impressive Python libraries for exploratory data analysis (EDA). Not a Python pro? Don’t worry! You can benefit from these tools even if you know nothing about Python.
They could save weeks of your data exploration and improve its quality. …
Hi Bex T., thanks for the great question. It's indeed simple.
Here's how to do it on Linux. I apologize; I don't use Windows often. But I believe the method could be similar.
Create a file and name it the way you need the API. For illustration, I've created one called
hello. Note that it doesn't have any file type extension such as .py or .sh.
The following is how its content should look like.
Note the first line
#! /usr/bin/python . This line will tell the OS which executor to use when running the script…
In a previous article, I wrote about the limitations of using Python web apps for analytics projects. Some of the points kindled readers' curiosity and inspired me to write another story to complement it.
The central question of this article is, “if Python has severe drawbacks because of its sync behavior, how do platforms such as Instagram and Spotify use it to serve millions around the world?”
While I have no official information from these platforms (or similar ones), I have insights into handling such massive requests from my experience.
Here in this article, I prepared a demo to show…