Title: Data Engineer Jobs Timestamp (or timespan) of the dataset(dd/mm/yyyy): 17/10/2022 to 29/01/2023 Data source (and related link): https://kaggle.com Link to the dataset (or a set of datasets): data engineer jobs: https://www.kaggle.com/datasets/andrewmvd/data-engineer-jobs longitude and latitude for the states: https://www.kaggle.com/datasets/paultimothymooney/latitude-and-longitude-for-every-country-and-state Link to a file containing the metadata of the dataset: https://github.com/Hanuro/data-engineer-jobs-analysis/blob/main/research/assets/dataviz/dataset/metadata.txt Short abstract of the data visualization process: The data visualization process was made through the use of python and libraries such seaborn, plotly, pandas, matplotlib.pyplot, numpy, jupyter this last one was mainly used for the organization of the code An ordered list of all the actions performed: [Importing of the libraries] [Pre-processing such as reading the dataset with pandas and cleaining through the use of its internal functions to check for nan values and duplicates.] [Deletion of useless columns that had too many nan values] [Filling the nan values through kNN s.t. we don't mess up the visualization] [Checking if the kNN was successful through some internal plots] [Filling string columns's nan values with the most popular one to not have nan values in the visualizations] [Coding of the visualization through the use of plotly documentation and pandas]