[DB] use queue pool for async db engine #8005

rohansonecha · 2025-11-18T19:24:43Z

Addresses #7829 (review). The main benefit for using QueuePools as compared to NullPools is that connection pooling allows maintaining long running connections in memory for efficient re-use (ref).

Tested with local postgres (NullPools), initialized the sync engine as well as the async engine. Sync engine is initialized by most normal operations like sky launch, sky status, etc. Async engine is used by provision logs for checking the cluster status, so I ran provision logs during cluster launch to verify proper async engine initialization.

I ran the same tests described above with a remote api server and a beefy cloud sql instance to force using QueuePools and veriifed that both sync and async engine initialization works as expected.

Tested (run the relevant ones):

Code formatting: install pre-commit (auto-check on commit) or bash format.sh
Any manual or new tests for this PR (please specify below)
All smoke tests: /smoke-test (CI) or pytest tests/test_smoke.py (local)
Relevant individual tests: /smoke-test -k test_name (CI) or pytest tests/test_smoke.py::test_name (local)
Backward compatibility: /quicktest-core (CI) or pytest tests/smoke_tests/test_backward_compat.py (local)

rohansonecha · 2025-11-18T19:31:37Z

/quicktest-core
/smoke-test
/smoke-test --postgres

rohansonecha · 2025-11-18T21:20:38Z

/smoke-test -k test_managed_jobs_basic

rohansonecha · 2025-11-18T21:21:01Z

/smoke-test --postgres -k test_managed_jobs_logs_gc

rohansonecha · 2025-11-18T21:22:27Z

/smoke-test --postgres -k test_nonexistent_bucket
/smoke-test --postgres -k test_upload_to_existing_bucket

rohansonecha · 2025-11-18T23:28:00Z

/smoke-test -k test_managed_jobs_basic
/smoke-test --postgres -k test_managed_jobs_logs_gc
/smoke-test --postgres -k test_nonexistent_bucket
/smoke-test --postgres -k test_upload_to_existing_bucket

Michaelvll · 2025-11-18T23:37:24Z

Can we also mention what is the major benefits we are getting from this?

rohansonecha · 2025-11-19T01:24:17Z

/smoke-test --postgres -k test_managed_jobs_logs_gc
/smoke-test --postgres -k test_nonexistent_bucket
/smoke-test --postgres -k test_upload_to_existing_bucket

rohansonecha · 2025-11-19T06:03:32Z

/smoke-test --postgres -k test_managed_jobs_logs_gc
/smoke-test --postgres -k test_nonexistent_bucket
/smoke-test --postgres -k test_upload_to_existing_bucket

rohansonecha · 2025-11-19T19:48:11Z

Can we also mention what is the major benefits we are getting from this?

Updated the PR description

SeungjinYang

Thanks @rohansonecha! Couple of minor comments, but this should be a meaningful improvement to DB efficiency

sky/utils/db/db_utils.py

SeungjinYang · 2025-11-19T23:49:28Z

sky/utils/db/db_utils.py

-                            conn_string, poolclass=sqlalchemy.pool.NullPool))
+                    kw_args = {'poolclass': sqlalchemy.NullPool}
+                    if async_engine:
+                        _postgres_engine_cache[conn_string] = (


I do understand you don't need a separate engine cache for async and sync engines because their conn str would be different, but I actually suggest separating them out on a readability argument (so there's no second guessing involved at all)

Eh, I thought this would be easy enough because I thought all cases where code gets / sets from cache is if / elsed on sync / async anyway, but that's not the case. I'm fine with you adding a comment somewhere in code on why it's safe to use the same cache across sync and async engines.

Discussed offline, but I think adding a separate engine cache would clutter the logic in this code block. I added a comment to improve the clarity and future maintainability of this code path.

Co-authored-by: Seung Jin <[email protected]>

rohansonecha · 2025-11-20T04:48:23Z

/quicktest-core
/smoke-test -k basic
/smoke-test --postgres -k basic

rohansonecha requested review from SeungjinYang and cg505 November 18, 2025 19:24

rohansonecha self-assigned this Nov 18, 2025

wip

4859e35

rohansonecha force-pushed the async-queue-pool branch from aa1d855 to 4859e35 Compare November 19, 2025 16:29

SeungjinYang approved these changes Nov 19, 2025

View reviewed changes

rohansonecha and others added 4 commits November 19, 2025 15:54

Apply suggestion from @SeungjinYang

bb75b13

Co-authored-by: Seung Jin <[email protected]>

complete

3e0c179

clean

80af136

comment

41b4669

SeungjinYang approved these changes Nov 20, 2025

View reviewed changes

rohansonecha merged commit 918ef6d into master Nov 20, 2025
23 checks passed

rohansonecha deleted the async-queue-pool branch November 20, 2025 05:42

[DB] use queue pool for async db engine #8005

[DB] use queue pool for async db engine #8005

Conversation

rohansonecha commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rohansonecha commented Nov 18, 2025

Uh oh!

rohansonecha commented Nov 18, 2025

Uh oh!

rohansonecha commented Nov 18, 2025

Uh oh!

rohansonecha commented Nov 18, 2025

Uh oh!

rohansonecha commented Nov 18, 2025

Uh oh!

Michaelvll commented Nov 18, 2025

Uh oh!

rohansonecha commented Nov 19, 2025

Uh oh!

rohansonecha commented Nov 19, 2025

Uh oh!

rohansonecha commented Nov 19, 2025

Uh oh!

SeungjinYang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SeungjinYang Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

SeungjinYang Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

rohansonecha Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

rohansonecha commented Nov 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rohansonecha commented Nov 18, 2025 •

edited

Loading