feat: Make pagination object available in more scopes of REST stream class #3245

edgarrmondragon · 2025-08-20T22:07:18Z

Summary

Resolves #1606

This PR makes the pagination object available as an instance attribute (self.paginator) across all method scopes within the REST stream class, not just within request_records. This eliminates the need to duplicate pagination configuration values across different methods.

Key Changes

Added pagination instance attribute - _current_paginator with proper lifecycle management
Implemented context manager pattern - Clean creation and cleanup of paginator per request cycle
Enhanced method documentation - Updated get_url_params with usage examples
Comprehensive test coverage - 4 new test cases covering all aspects of the feature

Benefits

Reduces code duplication - Access pagination config from any method during request processing
Maintains backward compatibility - All existing implementations continue to work unchanged
Fresh paginator per context - Each request_records call gets a new paginator instance
Improved developer experience - Clear error messages when accessing paginator incorrectly

Usage Example

Before (with duplication):

class MyStream(RESTStream):
    PAGE_SIZE = 25

    def get_new_paginator(self):
        return BaseOffsetPaginator(start_value=0, page_size=self.PAGE_SIZE)

    def get_url_params(self, context, next_page_token):
        return {"limit": self.PAGE_SIZE}  # Duplicated constant

After (no duplication):

class MyStream(RESTStream):
    def get_new_paginator(self):
        return BaseOffsetPaginator(start_value=0, page_size=25)

    def get_url_params(self, context, next_page_token):
        # Access paginator configuration directly
        try:
            limit = self.paginator.page_size  # If accessible
        except AttributeError:
            limit = 25  # fallback
        return {"limit": limit}

Testing

All existing tests continue to pass
New comprehensive test suite validates:
- Paginator accessibility across multiple methods
- Fresh paginator creation per request
- Proper lifecycle cleanup
- Backward compatibility
- Error handling when accessed incorrectly

🤖 Generated with Claude Code

Summary by Sourcery

Make the pagination object available across all REST stream methods by introducing a context-managed paginator property while preserving existing behaviors

New Features:

Expose self.paginator property during request_records lifecycle
Introduce _paginator_context context manager to handle paginator creation and cleanup

Enhancements:

Refactor request_records to use the paginator context manager
Update get_url_params documentation to reference the new paginator property

Documentation:

Enhance RESTStream docstrings to describe self.paginator availability and usage

Tests:

Add tests for paginator accessibility in multiple methods
Add tests for fresh paginator creation per request_records call
Add tests for paginator lifecycle cleanup
Add tests for fallback when get_new_paginator returns None
Add tests for backward compatibility with legacy pagination patterns

… stream class

sourcery-ai · 2025-08-20T22:07:30Z

Reviewer's Guide

This PR refactors the RESTStream pagination implementation by introducing a private _current_paginator attribute and a _paginator_context manager to expose a fresh paginator instance (self.paginator) across all method scopes, replaces the inline paginator logic in request_records with this context manager, enhances documentation for pagination, and adds comprehensive tests covering paginator accessibility, lifecycle, freshness, fallback behavior, and backward compatibility.

Sequence diagram for paginator lifecycle in request_records

sequenceDiagram
    participant RESTStream
    participant Paginator
    participant Metrics
    participant HTTPRequest
    RESTStream->>RESTStream: enter _paginator_context()
    RESTStream->>Paginator: get_new_paginator()
    RESTStream->>RESTStream: set self._current_paginator
    RESTStream->>Metrics: start http_request_counter
    loop while not paginator.finished
        RESTStream->>HTTPRequest: prepare_request(context, next_page_token)
        RESTStream->>HTTPRequest: decorated_request(prepared_request, context)
        RESTStream->>Metrics: increment counter
        RESTStream->>RESTStream: update_sync_costs
        RESTStream->>RESTStream: parse_response(resp)
        RESTStream->>Paginator: advance(resp)
    end
    RESTStream->>RESTStream: exit _paginator_context(), cleanup self._current_paginator

Class diagram for updated RESTStream pagination lifecycle

classDiagram
    class RESTStream {
        - _current_paginator: BaseAPIPaginator | None
        + paginator: BaseAPIPaginator
        + _paginator_context(): Iterator[BaseAPIPaginator]
        + request_records(context: Context | None): Iterable[dict]
        + get_new_paginator()
        + get_url_params(context, next_page_token)
    }
    RESTStream --> BaseAPIPaginator
    RESTStream --> SinglePagePaginator
    RESTStream : uses _paginator_context for paginator lifecycle
    RESTStream : exposes paginator property during request_records
    RESTStream : get_url_params can access self.paginator
    class BaseAPIPaginator {
        + current_value
        + finished
        + advance(resp)
        + continue_if_empty(resp)
        + page_size
    }
    class SinglePagePaginator {
        + finished
    }
    BaseAPIPaginator <|-- SinglePagePaginator

Class diagram for paginator property and context manager

classDiagram
    class RESTStream {
        + paginator: BaseAPIPaginator
        + _paginator_context(): Iterator[BaseAPIPaginator]
    }
    RESTStream --> BaseAPIPaginator
    RESTStream : paginator only available during request_records
    RESTStream : _paginator_context manages lifecycle

File-Level Changes

Change	Details	Files
Introduce paginator attribute and lifecycle context manager	Initialize `_current_paginator` in RESTStream constructor Add `paginator` property with runtime check for active context Implement `_paginator_context` context manager for setup/cleanup Wrap `request_records` logic in `_paginator_context` and move paginator advancement	`singer_sdk/streams/rest.py`
Update pagination-related documentation	Expand `paginator` property docstring to describe usage and errors Enhance `get_url_params` doc with paginator availability note	`singer_sdk/streams/rest.py`
Add comprehensive pagination tests	Test paginator access within various methods and error outside context Verify fresh paginator creation per `request_records` call Ensure paginator cleanup after processing Check fallback to SinglePagePaginator when none provided Validate backward compatibility of legacy vs. modern usage	`tests/core/rest/test_pagination.py`

Assessment against linked issues

Issue	Objective	Addressed
#1606	Make the pagination object available as an instance attribute (e.g., self.paginator) in more scopes of the REST stream class, not just within request_records.	✅
#1606	Ensure that the paginator instance is accessible from other methods such as get_url_params, allowing access to pagination configuration without duplicating logic.	✅
#1606	Maintain proper lifecycle management so that the paginator is only available during active stream processing and is cleaned up after use.	✅

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey there - I've reviewed your changes - here's some feedback:

Add a test for early generator close to ensure _current_paginator always resets even when iteration stops prematurely
Consider isolating paginator state using thread-local or stack-based storage to support concurrent or nested request_records calls without state collisions

Prompt for AI Agents

Please address the comments from this code review:
## Overall Comments
- Add a test for early generator close to ensure `_current_paginator` always resets even when iteration stops prematurely
- Consider isolating paginator state using thread-local or stack-based storage to support concurrent or nested `request_records` calls without state collisions

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2025-08-20T22:08:27Z