Skip to content

Commit c3f07a0

Browse files
Add 4.0.1 news (#625)
<img width="1203" height="557" alt="Screenshot 2025-09-06 at 21 02 06" src="https://github.com/user-attachments/assets/da6ab796-9b05-42df-8b48-7063d7fedac0" /> <img width="1225" height="542" alt="Screenshot 2025-09-06 at 21 02 16" src="https://github.com/user-attachments/assets/dda8f8d2-97f3-4c0d-afb2-6e40ef6c8d4c" /> <img width="1216" height="372" alt="Screenshot 2025-09-06 at 21 03 45" src="https://github.com/user-attachments/assets/196322f4-4ae5-4f9f-ad1d-9e444a65700f" /> <img width="1224" height="585" alt="Screenshot 2025-09-06 at 21 03 54" src="https://github.com/user-attachments/assets/13818739-8c24-4679-b7dc-c56cb7d24b52" />
1 parent 13a3699 commit c3f07a0

File tree

269 files changed

+1489
-782
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

269 files changed

+1489
-782
lines changed

documentation.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ navigation:
1212
<p>Setup instructions, programming guides, and other documentation are available for each stable version of Spark below:</p>
1313

1414
<ul>
15+
<li><a href="{{site.baseurl}}/docs/4.0.1/">Spark 4.0.1</a></li>
1516
<li><a href="{{site.baseurl}}/docs/4.0.0/">Spark 4.0.0</a></li>
1617
<li><a href="{{site.baseurl}}/docs/3.5.6/">Spark 3.5.6</a></li>
1718
<li><a href="{{site.baseurl}}/docs/3.5.5/">Spark 3.5.5</a></li>

downloads.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ Spark artifacts are [hosted in Maven Central](https://search.maven.org/search?q=
3535

3636
groupId: org.apache.spark
3737
artifactId: spark-core_2.13
38-
version: 4.0.0
38+
version: 4.0.1
3939

4040
### Installing with PyPi
4141
<a href="https://pypi.org/project/pyspark/">PySpark</a> is now available in pypi. To install just run `pip install pyspark`.

js/downloads.js

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ var packagesV14 = [hadoop3p, hadoop3pscala213, hadoopFree, sources];
2323
// 4.0.0+
2424
var packagesV15 = [hadoop34p, hadoop34pSparkConnect, hadoopFree, sources];
2525

26-
addRelease("4.0.0", new Date("05/23/2025"), packagesV15, true);
26+
addRelease("4.0.1", new Date("09/06/2025"), packagesV15, true);
2727
addRelease("3.5.6", new Date("05/29/2025"), packagesV14, true);
2828

2929
function append(el, contents) {
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
---
2+
layout: post
3+
title: Spark 4.0.1 released
4+
categories:
5+
- News
6+
tags: []
7+
status: publish
8+
type: post
9+
published: true
10+
meta:
11+
_edit_last: '4'
12+
_wpas_done_all: '1'
13+
---
14+
We are happy to announce the availability of <a href="{{site.baseurl}}/releases/spark-release-4-0-1.html" title="Spark Release 4.0.1">Spark 4.0.1</a>! Visit the <a href="{{site.baseurl}}/releases/spark-release-4-0-1.html" title="Spark Release 4.0.1">release notes</a> to read about the new features, or <a href="{{site.baseurl}}/downloads.html">download</a> the release today.
Lines changed: 115 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,115 @@
1+
---
2+
layout: post
3+
title: Spark Release 4.0.1
4+
categories: []
5+
tags: []
6+
status: publish
7+
type: post
8+
published: true
9+
meta:
10+
_edit_last: '4'
11+
_wpas_done_all: '1'
12+
---
13+
14+
Apache Spark 4.0.1 is a maintenance release containing important correctness and stability fixes. This release is based on the branch-4.0 maintenance branch of Spark. We strongly recommend all 4.0 users to upgrade to this stable release.
15+
16+
### Notable changes
17+
18+
- [[SPARK-49872]](https://issues.apache.org/jira/browse/SPARK-49872) Allow unlimited json size again
19+
- [[SPARK-50137]](https://issues.apache.org/jira/browse/SPARK-50137) Avoid fallback to Hive-incompatible ways when table creation fails by thrift exception
20+
- [[SPARK-50748]](https://issues.apache.org/jira/browse/SPARK-50748) Fix a race condition issue which happens when operations are interrupted
21+
- [[SPARK-50889]](https://issues.apache.org/jira/browse/SPARK-50889) Fix a race condition issue which happens when operations are interrupted
22+
- [[SPARK-51430]](https://issues.apache.org/jira/browse/SPARK-51430) Stop PySpark context logger from propagating to stdout
23+
- [[SPARK-51920]](https://issues.apache.org/jira/browse/SPARK-51920) Fix composite/nested structtype in value state for python
24+
- [[SPARK-52023]](https://issues.apache.org/jira/browse/SPARK-52023) Fix data corruption/segfault returning Option[Product] from udaf
25+
- [[SPARK-52146]](https://issues.apache.org/jira/browse/SPARK-52146) Detect cyclic function usage in SQL UDFs
26+
- [[SPARK-52147]](https://issues.apache.org/jira/browse/SPARK-52147) Block temporary object references in persistent SQL UDFs
27+
- [[SPARK-52148]](https://issues.apache.org/jira/browse/SPARK-52148) Fix CREATE OR REPLACE for SQL TVFs
28+
- [[SPARK-52153]](https://issues.apache.org/jira/browse/SPARK-52153) Fix from_json and to_json with variant
29+
- [[SPARK-52237]](https://issues.apache.org/jira/browse/SPARK-52237) Fix the documentation of hypot function
30+
- [[SPARK-52240]](https://issues.apache.org/jira/browse/SPARK-52240) Corrected row index usage when exploding packed arrays in vectorized reader
31+
- [[SPARK-52259]](https://issues.apache.org/jira/browse/SPARK-52259) Fix Param class binary compatibility
32+
- [[SPARK-52265]](https://issues.apache.org/jira/browse/SPARK-52265) Fix regex leading to empty PROCESS_TABLES.testingVersions in HiveExternalCatalogVersionsSuite
33+
- [[SPARK-52267]](https://issues.apache.org/jira/browse/SPARK-52267) Match field id in ParquetToSparkSchemaConverter
34+
- [[SPARK-52287]](https://issues.apache.org/jira/browse/SPARK-52287) Improve `SparkContext` not to populate `o.a.s.internal.io.cloud.*`-related setting if not exist
35+
- [[SPARK-52300]](https://issues.apache.org/jira/browse/SPARK-52300) Make SQL UDTVF resolution use consistent configurations with view resolution
36+
- [[SPARK-52313]](https://issues.apache.org/jira/browse/SPARK-52313) Correctly resolve reference data type for Views with default collation
37+
- [[SPARK-52316]](https://issues.apache.org/jira/browse/SPARK-52316) Upgrade Kafka to 3.9.1
38+
- [[SPARK-52324]](https://issues.apache.org/jira/browse/SPARK-52324) Move Spark docs to the release directory
39+
- [[SPARK-52329]](https://issues.apache.org/jira/browse/SPARK-52329) Remove private sql scoping tags for new transformWithState API
40+
- [[SPARK-52339]](https://issues.apache.org/jira/browse/SPARK-52339) Fix comparison of InMemoryFileIndex instances
41+
- [[SPARK-52345]](https://issues.apache.org/jira/browse/SPARK-52345) Fix NULL behavior in scripting conditions
42+
- [[SPARK-52350]](https://issues.apache.org/jira/browse/SPARK-52350) Fix link for SS programming guide page
43+
- [[SPARK-52384]](https://issues.apache.org/jira/browse/SPARK-52384) Fix bug Connect should insensitive for JDBC options.
44+
- [[SPARK-52386]](https://issues.apache.org/jira/browse/SPARK-52386) Refactor the `HistoryServerSuite` to support regenerating the `expectation.json` files using `SPARK_GENERATE_GOLDEN_FILES=1`
45+
- [[SPARK-52396]](https://issues.apache.org/jira/browse/SPARK-52396) Artifact Root Directory should use tmpdir
46+
- [[SPARK-52397]](https://issues.apache.org/jira/browse/SPARK-52397) Idempotent ExecutePlan: second ExecutePlan with same operationId and plan should reattach
47+
- [[SPARK-52398]](https://issues.apache.org/jira/browse/SPARK-52398) Change ALTER TABLE ALTER COLUMN TYPE STRING not to apply default collation if original data type was instance of StringType
48+
- [[SPARK-52413]](https://issues.apache.org/jira/browse/SPARK-52413) `test_install_spark` switch to Spark 3.5.6
49+
- [[SPARK-52420]](https://issues.apache.org/jira/browse/SPARK-52420) Make test_udtf_with_invalid_return_type compatible with Python only client
50+
- [[SPARK-52421]](https://issues.apache.org/jira/browse/SPARK-52421) Automatically send the RC vote email
51+
- [[SPARK-52450]](https://issues.apache.org/jira/browse/SPARK-52450) Improve performance of schema deepcopy
52+
- [[SPARK-52453]](https://issues.apache.org/jira/browse/SPARK-52453) Automatically release and drop artifacts in Apache Nexus repository
53+
- [[SPARK-52454]](https://issues.apache.org/jira/browse/SPARK-52454) Automatically remove old releases from the mirror
54+
- [[SPARK-52489]](https://issues.apache.org/jira/browse/SPARK-52489) Forbid duplicate SQLEXCEPTION and NOT FOUND handlers inside SQL Script
55+
- [[SPARK-52497]](https://issues.apache.org/jira/browse/SPARK-52497) Add documentation for SQL UDFs
56+
- [[SPARK-52499]](https://issues.apache.org/jira/browse/SPARK-52499) Add more SQL query tests for different data types
57+
- [[SPARK-52516]](https://issues.apache.org/jira/browse/SPARK-52516) Don't hold previous iterator reference after advancing to next file in ParquetPartitionReaderFactory
58+
- [[SPARK-52521]](https://issues.apache.org/jira/browse/SPARK-52521) Right#replacement should not access SQLConf dynamically
59+
- [[SPARK-52529]](https://issues.apache.org/jira/browse/SPARK-52529) Fully upgrade jekyll from 4.3 to 4.4
60+
- [[SPARK-52531]](https://issues.apache.org/jira/browse/SPARK-52531) `OuterReference` in subquery aggregate is incorrectly tied to outer query aggregate
61+
- [[SPARK-52542]](https://issues.apache.org/jira/browse/SPARK-52542) Use `/nonexistent` instead of nonexistent `/opt/spark`
62+
- [[SPARK-52553]](https://issues.apache.org/jira/browse/SPARK-52553) Fix NumberFormatException when reading v1 changelog
63+
- [[SPARK-52562]](https://issues.apache.org/jira/browse/SPARK-52562) Automatically create the base of release notes and push
64+
- [[SPARK-52568]](https://issues.apache.org/jira/browse/SPARK-52568) Fix `exec-maven-plugin` version used by `dev/test-dependencies.sh`
65+
- [[SPARK-52584]](https://issues.apache.org/jira/browse/SPARK-52584) Make build script to support preview releases in finalize step
66+
- [[SPARK-52590]](https://issues.apache.org/jira/browse/SPARK-52590) Add SQL query tests on optional return types
67+
- [[SPARK-52611]](https://issues.apache.org/jira/browse/SPARK-52611) Fix SQLConf version for excludeSubqueryRefsFromRemoveRedundantAliases configuration
68+
- [[SPARK-52612]](https://issues.apache.org/jira/browse/SPARK-52612) Add an env NO_PROVIDED_SPARK_JARS to control collection behavior of sbt/package for spark-avro.jar and spark-protobuf.jar
69+
- [[SPARK-52613]](https://issues.apache.org/jira/browse/SPARK-52613) Restore printing full stacktrace when HBase/Hive DelegationTokenProvider hit exception
70+
- [[SPARK-52684]](https://issues.apache.org/jira/browse/SPARK-52684) Make CACHE TABLE Commands atomic while encounting execution errors
71+
- [[SPARK-52691]](https://issues.apache.org/jira/browse/SPARK-52691) Upgrade ORC to 2.1.3
72+
- [[SPARK-52707]](https://issues.apache.org/jira/browse/SPARK-52707) Remove preview postfix when looking up the JIRA versions
73+
- [[SPARK-52721]](https://issues.apache.org/jira/browse/SPARK-52721) Wrong message parameter for CANNOT_PARSE_DATATYPE
74+
- [[SPARK-52735]](https://issues.apache.org/jira/browse/SPARK-52735) Fix missing error conditions for SQL UDFs
75+
- [[SPARK-52737]](https://issues.apache.org/jira/browse/SPARK-52737) Pushdown predicate and number of apps to FsHistoryProvider when listing applications
76+
- [[SPARK-52741]](https://issues.apache.org/jira/browse/SPARK-52741) RemoveFiles ShuffleCleanup mode doesnt work with non-adaptive execution
77+
- [[SPARK-52749]](https://issues.apache.org/jira/browse/SPARK-52749) Replace preview1 to dev1 in its PyPI package name
78+
- [[SPARK-52753]](https://issues.apache.org/jira/browse/SPARK-52753) Make parseDataType binary compatible with previous versions
79+
- [[SPARK-52776]](https://issues.apache.org/jira/browse/SPARK-52776) Do not split the comm field in ProcfsMetricsGetter
80+
- [[SPARK-52786]](https://issues.apache.org/jira/browse/SPARK-52786) Make pyspark-client package to upload with preview naming
81+
- [[SPARK-52788]](https://issues.apache.org/jira/browse/SPARK-52788) Fix error of converting binary value in BinaryType to XML
82+
- [[SPARK-52791]](https://issues.apache.org/jira/browse/SPARK-52791) Fix error when inferring a UDT with a null first element
83+
- [[SPARK-52799]](https://issues.apache.org/jira/browse/SPARK-52799) Fix ThriftServerQueryTestSuite result comparison
84+
- [[SPARK-52809]](https://issues.apache.org/jira/browse/SPARK-52809) Don't hold reader and iterator references for all partitions in task completion listeners for metric update
85+
- [[SPARK-52828]](https://issues.apache.org/jira/browse/SPARK-52828) Make hashing for collated strings collation agnostic
86+
- [[SPARK-52832]](https://issues.apache.org/jira/browse/SPARK-52832) Fix JDBC dialect identifier quoting
87+
- [[SPARK-52833]](https://issues.apache.org/jira/browse/SPARK-52833) Fix VariantBuilder.appendFloat
88+
- [[SPARK-52870]](https://issues.apache.org/jira/browse/SPARK-52870) Properly quote variable names in FOR statement
89+
- [[SPARK-52873]](https://issues.apache.org/jira/browse/SPARK-52873) Further restrict when SHJ semi/anti join can ignore duplicate keys on the build side
90+
- [[SPARK-52899]](https://issues.apache.org/jira/browse/SPARK-52899) Fix QueryExecutionErrorsSuite test to register H2Dialect back
91+
- [[SPARK-52908]](https://issues.apache.org/jira/browse/SPARK-52908) Prevent for iterator variable name clashing with names of labels in the path to the root of AST
92+
- [[SPARK-52942]](https://issues.apache.org/jira/browse/SPARK-52942) YARN External Shuffle Service jar should include scala-library
93+
- [[SPARK-52976]](https://issues.apache.org/jira/browse/SPARK-52976) Fix Python UDF not accepting collated strings as input param
94+
- [[SPARK-52989]](https://issues.apache.org/jira/browse/SPARK-52989) Add explicit close API to RocksDB State store iterator and fix current usage
95+
- [[SPARK-53020]](https://issues.apache.org/jira/browse/SPARK-53020) JPMS args should also apply to non-SparkSubmit process
96+
- [[SPARK-53054]](https://issues.apache.org/jira/browse/SPARK-53054) Fix the connect.DataFrameReader default format behavior
97+
- [[SPARK-53074]](https://issues.apache.org/jira/browse/SPARK-53074) Avoid partial clustering in SPJ to meet a child's required distribution
98+
- [[SPARK-53094]](https://issues.apache.org/jira/browse/SPARK-53094) Fix CUBE with aggregate containing HAVING clauses
99+
- [[SPARK-53120]](https://issues.apache.org/jira/browse/SPARK-53120) Recover _source directory for PySpark documentation
100+
- [[SPARK-53130]](https://issues.apache.org/jira/browse/SPARK-53130) Fix toJson behavior of collated string types
101+
- [[SPARK-53155]](https://issues.apache.org/jira/browse/SPARK-53155) Global lower agggregation should not be replaced with a project
102+
- [[SPARK-53167]](https://issues.apache.org/jira/browse/SPARK-53167) Spark launcher isRemote also respects properties files
103+
- [[SPARK-53176]](https://issues.apache.org/jira/browse/SPARK-53176) Spark launcher should respect `--load-spark-defaults`
104+
- [[SPARK-53275]](https://issues.apache.org/jira/browse/SPARK-53275) Handle stateful expressions when ordering in interpreted mode
105+
- [[SPARK-53291]](https://issues.apache.org/jira/browse/SPARK-53291) Fix nullability for value column
106+
- [[SPARK-53326]](https://issues.apache.org/jira/browse/SPARK-53326) Upgrade ORC Format to 1.1.1
107+
- [[SPARK-53342]](https://issues.apache.org/jira/browse/SPARK-53342) Fix Arrow converter to handle multiple record batches in single IPC stream
108+
- [[SPARK-53348]](https://issues.apache.org/jira/browse/SPARK-53348) Always persist ANSI value when creating a view or assume it when querying if not stored
109+
- [[SPARK-53360]](https://issues.apache.org/jira/browse/SPARK-53360) Once strategy with ConstantFolding's idempotence should not be broken
110+
- [[SPARK-53394]](https://issues.apache.org/jira/browse/SPARK-53394) UninterruptibleLock.isInterruptible should avoid duplicated interrupt
111+
- [[SPARK-53435]](https://issues.apache.org/jira/browse/SPARK-53435) Fix race condition in CachedRDDBuilder
112+
113+
You can consult JIRA for the [detailed changes](https://s.apache.org/spark-4.0.1).
114+
115+
We would like to acknowledge all community members for contributing patches to this release.

site/404.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -178,6 +178,9 @@ <h2 id="oops">Oops!</h2>
178178
<h5>Latest News</h5>
179179
<ul class="list-unstyled">
180180

181+
<li><a href="/news/spark-4-0-1-released.html">Spark 4.0.1 released</a>
182+
<span class="small">(Sep 06, 2025)</span></li>
183+
181184
<li><a href="/news/spark-4-1-0-preview1-released.html">Preview release of Spark 4.1.0</a>
182185
<span class="small">(Sep 02, 2025)</span></li>
183186

@@ -187,9 +190,6 @@ <h5>Latest News</h5>
187190
<li><a href="/news/spark-4-0-0-released.html">Spark 4.0.0 released</a>
188191
<span class="small">(May 23, 2025)</span></li>
189192

190-
<li><a href="/news/spark-3-5-5-released.html">Spark 3.5.5 released</a>
191-
<span class="small">(Feb 27, 2025)</span></li>
192-
193193
</ul>
194194
<p class="small" style="text-align: right;"><a href="/news/index.html">Archive</a></p>
195195
</div>

site/committers.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -725,6 +725,9 @@ <h3>Policy on backporting bug fixes</h3>
725725
<h5>Latest News</h5>
726726
<ul class="list-unstyled">
727727

728+
<li><a href="/news/spark-4-0-1-released.html">Spark 4.0.1 released</a>
729+
<span class="small">(Sep 06, 2025)</span></li>
730+
728731
<li><a href="/news/spark-4-1-0-preview1-released.html">Preview release of Spark 4.1.0</a>
729732
<span class="small">(Sep 02, 2025)</span></li>
730733

@@ -734,9 +737,6 @@ <h5>Latest News</h5>
734737
<li><a href="/news/spark-4-0-0-released.html">Spark 4.0.0 released</a>
735738
<span class="small">(May 23, 2025)</span></li>
736739

737-
<li><a href="/news/spark-3-5-5-released.html">Spark 3.5.5 released</a>
738-
<span class="small">(Feb 27, 2025)</span></li>
739-
740740
</ul>
741741
<p class="small" style="text-align: right;"><a href="/news/index.html">Archive</a></p>
742742
</div>

site/community.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -374,6 +374,9 @@ <h3>Powered by</h3>
374374
<h5>Latest News</h5>
375375
<ul class="list-unstyled">
376376

377+
<li><a href="/news/spark-4-0-1-released.html">Spark 4.0.1 released</a>
378+
<span class="small">(Sep 06, 2025)</span></li>
379+
377380
<li><a href="/news/spark-4-1-0-preview1-released.html">Preview release of Spark 4.1.0</a>
378381
<span class="small">(Sep 02, 2025)</span></li>
379382

@@ -383,9 +386,6 @@ <h5>Latest News</h5>
383386
<li><a href="/news/spark-4-0-0-released.html">Spark 4.0.0 released</a>
384387
<span class="small">(May 23, 2025)</span></li>
385388

386-
<li><a href="/news/spark-3-5-5-released.html">Spark 3.5.5 released</a>
387-
<span class="small">(Feb 27, 2025)</span></li>
388-
389389
</ul>
390390
<p class="small" style="text-align: right;"><a href="/news/index.html">Archive</a></p>
391391
</div>

site/contributing.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -721,6 +721,9 @@ <h2>Code of conduct</h2>
721721
<h5>Latest News</h5>
722722
<ul class="list-unstyled">
723723

724+
<li><a href="/news/spark-4-0-1-released.html">Spark 4.0.1 released</a>
725+
<span class="small">(Sep 06, 2025)</span></li>
726+
724727
<li><a href="/news/spark-4-1-0-preview1-released.html">Preview release of Spark 4.1.0</a>
725728
<span class="small">(Sep 02, 2025)</span></li>
726729

@@ -730,9 +733,6 @@ <h5>Latest News</h5>
730733
<li><a href="/news/spark-4-0-0-released.html">Spark 4.0.0 released</a>
731734
<span class="small">(May 23, 2025)</span></li>
732735

733-
<li><a href="/news/spark-3-5-5-released.html">Spark 3.5.5 released</a>
734-
<span class="small">(Feb 27, 2025)</span></li>
735-
736736
</ul>
737737
<p class="small" style="text-align: right;"><a href="/news/index.html">Archive</a></p>
738738
</div>

site/developer-tools.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -707,6 +707,9 @@ <h2>Generative tooling usage</h2>
707707
<h5>Latest News</h5>
708708
<ul class="list-unstyled">
709709

710+
<li><a href="/news/spark-4-0-1-released.html">Spark 4.0.1 released</a>
711+
<span class="small">(Sep 06, 2025)</span></li>
712+
710713
<li><a href="/news/spark-4-1-0-preview1-released.html">Preview release of Spark 4.1.0</a>
711714
<span class="small">(Sep 02, 2025)</span></li>
712715

@@ -716,9 +719,6 @@ <h5>Latest News</h5>
716719
<li><a href="/news/spark-4-0-0-released.html">Spark 4.0.0 released</a>
717720
<span class="small">(May 23, 2025)</span></li>
718721

719-
<li><a href="/news/spark-3-5-5-released.html">Spark 3.5.5 released</a>
720-
<span class="small">(Feb 27, 2025)</span></li>
721-
722722
</ul>
723723
<p class="small" style="text-align: right;"><a href="/news/index.html">Archive</a></p>
724724
</div>

0 commit comments

Comments
 (0)