Skip to content

Releases: Eventual-Inc/Daft

v0.6.7

20 Oct 05:37
a45d0de

Choose a tag to compare

What's Changed 🚀

💥 Breaking Changes

✨ Features

🐛 Bug Fixes

🚀 Performance

♻️ Refactor

📖 Documentation

👷 CI

🔧 Maintenance

Full Changelog: v0.6.6...v0.6.7

v0.6.6

14 Oct 20:24
ced9da7

Choose a tag to compare

What's Changed 🚀

💥 Breaking Changes

✨ Features

🐛 Bug Fixes

🚀 Performance

  • perf(flotilla): Throttle worker refresh and autoscaling @colin-ho (#5351)
  • perf: Elide shuffle for distinct if input is already partitioned @colin-ho (#5354)
  • perf: use bincode instead of python for io_conf serialization in FileArray @universalmind303 (#5340)
  • perf: Only Serialize Required Cols in Process UDFs @srilman (#5069)

📖 Documentation

👷 CI

Full Changelog: v0.6.5...v0.6.6

v0.6.5

06 Oct 17:48
9ebbc1a

Choose a tag to compare

What's Changed 🚀

💥 Breaking Changes

✨ Features

🐛 Bug Fixes

🚀 Performance

♻️ Refactor

📖 Documentation

👷 CI

🔧 Maintenance

  • chore: Refactor DistributedPipelineNode to implement TreeDisplay @srilman (#5315)
  • chore: Enable interactive html for df.__repr_html__ @colin-ho (#5312)

Full Changelog: v0.6.4...v0.6.5

v0.6.4

01 Oct 00:00
dae715c

Choose a tag to compare

What's Changed 🚀

💥 Breaking Changes

✨ Features

🐛 Bug Fixes

📖 Documentation

✅ Tests

🔧 Maintenance

Full Changelog: v0.6.3...v0.6.4

v0.6.3

24 Sep 23:29
153a726

Choose a tag to compare

What's Changed 🚀

✨ Features

🐛 Bug Fixes

  • fix: Increase actor udf readiness timeout @colin-ho (#5258)
  • fix: Ensure super extension type is registered in to_arrow_dtype @ConeyLiu (#5265)

🚀 Performance

♻️ Refactor

  • refactor: add fragment_group_size to reduce lance scan task @Jay-ju (#5261)

📖 Documentation

🔧 Maintenance

⬆️ Dependencies

Full Changelog: v0.6.2...v0.6.3

v0.6.2

19 Sep 20:28
10086b2

Choose a tag to compare

What's Changed 🚀

✨ Features

🐛 Bug Fixes

🚀 Performance

📖 Documentation

👷 CI

🔧 Maintenance

  • chore: Don't log failures in url download if on_error="null" @colin-ho (#5231)
  • chore: Make the runner a separate global singleton @srilman (#5185)
  • chore: remove expr.str.regexp_split since we will immediately deprecate it @kevinzwang (#5216)
  • chore: Rewrite the dashboard server to use Axum @srilman (#5212)
  • chore: Remove reset_runner option @srilman (#5184)
  • chore: Split .str.split into .str.split and .str.regexp_split @srilman (#5211)
  • chore: enable logging configuration during launch udf worker @stayrascal (#5168)

⏪ Reverts

Full Changelog: v0.6.1...v0.6.2

v0.6.1

10 Sep 17:37
778eba6

Choose a tag to compare

What's Changed 🚀

✨ Features

🐛 Bug Fixes

🚀 Performance

  • perf: Use count_rows on fragment to reduce lance scans with limit pushdowns only @colin-ho (#5120)

📖 Documentation

  • docs: improve text readability on examples page @ykdojo (#5182)
  • docs: add TrendShift badge to README @ykdojo (#5181)
  • docs: improve explode method documentation with null/empty list examples @ykdojo (#5164)
  • docs: fix broken tutorial links and remove redundant file @ykdojo (#5154)

👷 CI

🔧 Maintenance

  • chore: add PR conventions note to AGENTS.md @ykdojo (#5166)

Full Changelog: v0.6.0...v0.6.1

v0.6.0

04 Sep 04:26
ef36735

Choose a tag to compare

What's Changed 🚀

v0.6.0 marks the official release of our new ray-based distributed engine, Flotilla! If you are already using the ray runner, you do not need to change anything. Setting the DAFT_RUNNER=ray environment variable, or within your python program via daft.context.set_runner_ray(), will use Flotilla by default.

All operations except cross join, sort merge join, and pivot are currently supported. We will be working on adding support for them soon! If you need to use the legacy ray runner, please set daft.set_execution_config(use_legacy_ray_runner=True)

💥 Breaking Changes

SQLCatalog was deprecated in v0.5 and is now removed, in favor of the bindings kwargs.

Before:

catalog = SQLCatalog({"test_data": df})
result = daft.sql("SELECT * FROM test_data", catalog=catalog)

After:

bindings = {"test_data": df}
result = daft.sql("SELECT * FROM test_data", **bindings)

✨ Features

🐛 Bug Fixes

🚀 Performance

♻️ Refactor

  • refactor: proper organization for providers and protocols in daft.ai @rchowell (#5125)

📖 Documentation

👷 CI

🔧 Maintenance

⏪ Reverts

Full Changelog: v0.5.22...v0.5.23

v0.5.22

28 Aug 15:00
248b7e8

Choose a tag to compare

What's Changed 🚀

💥 Breaking Changes

✨ Features

🐛 Bug Fixes

♻️ Refactor

📖 Documentation

👷 CI

🔧 Maintenance

  • chore: Remove deprecated APIs for 0.6 @colin-ho (#5050)
  • chore: disable hugging face library progress bars @kevinzwang (#5040)
  • chore: relax assertion in flaky sharding distribution test @Jay-ju (#5053)
  • chore(dev): use pyproject.toml to manage the dev dependencies @xy-xin (#4849)
  • chore: random the counter during creating DistributedActorPoolProject… @stayrascal (#5039)

⏪ Reverts

Full Changelog: v0.5.21...v0.5.22

v0.5.21

21 Aug 21:57
9017c8f

Choose a tag to compare

What's Changed 🚀

✨ Features

🐛 Bug Fixes

📖 Documentation

🔧 Maintenance

Full Changelog: v0.5.20...v0.5.21