For The Below File Given Link WIth Path Please Explore The FIle Using RDD
NOTE :=> please import pyspark and Create SparkSession and SParkContex Then Solve The challenge
File Path :=> file:///root/data/olympix_data_organized_without_header - Copy.csv
# Dataset Description
This data set consists of the following fields:
***Athlete [0]***: Name of the athlete
***Age [1]***: Age of the athlete
8i9o-p
***Country [2]***: The name of the country participating in Olympics
***Year [3]***: The year in which Olympics is conducted
***Closing Date [4]***: Closing date of Olympics
***Sport [5]***: Sports name
***Gold Medals [6]***: No. of gold medals
***Silver Medals [7]***: No. of silver medals
***Bronze Medals [8]***: No. of bronze medals
***Total Medals [9]***: Total no. of medals
"""
Question1 :=>
How many unique professions do we have in the data file?