pokemon:
- Data analysis on a simple dataset representing statistics about Pokemons
wordcount:
- Map-Reduce task that counts words in a text
- MapReduce job that associates each word with the most frequent word that follows it
- MapReduce jobs counting the number of unique words in a book
sp500-analysis: Data preprocessing and analysis using Spark:
- Describe the data: What does the data contain? What are the values of the classical metrics (min, max, average, standard deviation, ...)?
- Insights: Year in which the stock market varied the most, Company with highest VolumeExchanged/SharesOutstanding for each year...