Hudi In Action

Welcome to the code repository for the upcoming book "Hudi In Action". This repository contains hands-on examples, tutorials, and code samples that demonstrate Apache Hudi's capabilities for building robust data lakes.

📖 About the Book

"Hudi In Action" is a comprehensive guide to Apache Hudi, covering everything from basic concepts to advanced production patterns. The book provides practical examples and real-world scenarios to help you master Hudi for your data engineering needs.

🗂️ Repository Structure

hudiinaction/
├── chapter02/                          # Getting Started with Hudi
│   ├── hudi_pipeline_quickstart.scala  # Comprehensive Hudi tutorial
│   ├── trips_0.gz                      # NYC Taxi dataset sample (~1M rows)
│   └── README.md                       # Chapter-specific instructions
└── README.md                           # This file

Each chapter contains its own README with specific learning objectives, setup instructions, and detailed guidance.

📋 Prerequisites

Before running the examples, ensure you have:

Software Requirements

Apache Spark 3.5+ with Scala 2.12
Java 8 or 11
Apache Hudi 1.0.2+ (included via Spark packages)

Hardware Requirements

At least 4GB RAM available for Spark
2+ CPU cores recommended
~2GB disk space for sample data and tables

🚀 Getting Started

Clone this repository
Navigate to the chapter you want to explore
Follow the chapter-specific README for detailed setup instructions
Each chapter is self-contained with its own dataset and examples

🤝 Contributing

Found an issue or want to improve the examples? Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes with clear commit messages
Submit a pull request

📄 License

This code is provided as supplementary material for "Hudi In Action". Please refer to the book's license terms for usage restrictions.

📧 Support

For questions about the book or code examples:

Check the Issues page
Refer to the Apache Hudi documentation
Visit the Apache Hudi community

Happy learning with Apache Hudi! 🎉

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
chapter02		chapter02
chapter03		chapter03
chapter04		chapter04
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hudi In Action

📖 About the Book

🗂️ Repository Structure

📋 Prerequisites

Software Requirements

Hardware Requirements

🚀 Getting Started

🤝 Contributing

📄 License

📧 Support

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

codope/hudiinaction

Folders and files

Latest commit

History

Repository files navigation

Hudi In Action

📖 About the Book

🗂️ Repository Structure

📋 Prerequisites

Software Requirements

Hardware Requirements

🚀 Getting Started

🤝 Contributing

📄 License

📧 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages