Home

Fun with Data | 21.1.23

Visualizing Data about the
Production of Coffee Beans

So this is my first Data Science project ever and there is no method but total anarchy, lots of trial and mostly error.

Dataset is from: https://www.kaggle.com/datasets/yamaerenay/ico-coffee-dataset-worldwide


here we go:
Production of Coffee Beans

Attempt #1

Here is my first ever Data Chart! Its a small part of the original data Set I used. I am Starting to learn Data Science on Jupyter Notebook.

The look is ok but I am only using a fraction of a dataset that i typed from hand so i need to learn how to access a dataset directly without copying individual values into new series.

Attempt #2

Ok I figured out a way to turn a part of the dataframe into a list which in turn i can plot. this method is really not handy at all but hey progress.

Next I want to try to show multiple countries in the same plot

Attempt #3

Since Production in Brazil is magnitudes higher than in the other countries in the plot it becomes really unclear.

Maybe try a logarithmic y-scale.

Attempt #4

Oh no this is just bad.

Let's call it a day and go study.

Conclusion

I figured out how to do some v simple plots in Jupiter. Thats at least something. Next I will learn about Data Analysis.
I should start my next Data Project with a thesis because why plot something with no point to proove.
:)

Good friend ChatGPT helped with the css file