August 22, 2025: Weekly Status Update in Gluten #10510
                  
                    
                      GlutenPerfBot
                    
                  
                
                  started this conversation in
                General
              
            Replies: 0 comments
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
-
This weekly update is generated by LLMs. You're welcome to join our Github for in-depth discussions.
Overall Activity Summary
This week saw vibrant activity in the Gluten community, with a strong focus on enhancing data lake capabilities, optimizing core performance, and modernizing the codebase. Development was heavily concentrated on the Velox backend, with significant progress in Iceberg write support and shuffle performance. Additionally, a major effort to simplify the project by removing legacy hardware accelerator support is underway. The Flink integration continues to mature with new features, and proactive work on Spark 4.0 compatibility ensures the project stays future-ready.
Key Ongoing Projects
Several major initiatives are pushing the project forward, driven by dedicated contributors:
ColumnarShuffleReaderto merge input streams and improve performance for sort-based shuffles.Priority Items
We encourage the community to review and provide feedback on these important pull requests that are currently open:
Notable Discussions
Several important conversations are shaping the future of Gluten:
Emerging Trends
Based on this week's activity, we've identified several key trends:
Good First Issues
Looking to make your first contribution to Gluten? These issues are well-defined and a great way to get started:
date_from_unix_datefunction in the ClickHouse backend.split_partfunction for the ClickHouse backend.SparkPartitionIDfunction in the ClickHouse backend.MakeYMIntervalexpression for the ClickHouse backend.These issues are excellent entry points for contributors with some C++ and Scala/Java experience. They involve implementing a single, well-scoped function, allowing you to get familiar with the codebase and contribution process without needing to understand the entire system. Welcome to the community
Beta Was this translation helpful? Give feedback.
All reactions