Code Experts

pyspark RDD Second Assignment

For The Below File Given Link WIth Path Please Explore The FIle Using RDD
NOTE :=> please import pyspark and Create SparkSession and SParkContex Then Solve The challenge
File Path :=> file:///root/data/olympix_data_organized_without_header - Copy.csv

# Dataset Description

This data set consists of the following fields:

***Athlete [0]***: Name of the athlete

***Age [1]***: Age of the athlete
8i9o-p
***Country [2]***: The name of the country participating in Olympics

***Year [3]***: The year in which Olympics is conducted

***Closing Date [4]***: Closing date of Olympics

***Sport [5]***: Sports name

***Gold Medals [6]***: No. of gold medals

***Silver Medals [7]***: No. of silver medals

***Bronze Medals [8]***: No. of bronze medals

***Total Medals [9]***: Total no. of medals
"""

Question1 :=>

How many unique professions do we have in the data file?

Code :

Test

Total : 0 Discussion