ADD RWKV7 #2421

pass-lin · 2025-09-28T15:11:13Z

RWKV7 is one of the strongest RNN models available today, and we now provide a full implementation for it in keras_hub.

📚 References

🔗 Pre-trained Checkpoints (ModelScope)

Numerical-verification and Inference Example notebook

This is the first modern RNN architecture in keras_hub. With the resurgence of recurrent models, more pre-trained RNN backbones will follow; hence this PR also serves as a reference implementation for future work.

Current progress

[✅] backbone implementation
[✅] checkpoint-conversion script
[✅] tokenizer implementation
[✅] unit tests / examples
[✅] complete CausalLM task wrapper
[✅] Add document

gemini-code-assist · 2025-09-28T15:11:32Z

Summary of Changes

Hello @pass-lin, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the keras_hub library by integrating the RWKV7 model, a cutting-edge recurrent neural network. This addition not only provides a robust new model for users but also serves as a foundational reference implementation, encouraging the future inclusion of more modern RNN architectures within the library.

Highlights

RWKV7 Model Integration: Introduced the RWKV7 model, a powerful RNN architecture, into keras_hub, marking a significant expansion of the library's capabilities.
Comprehensive Implementation: The pull request includes a full implementation of the RWKV7 backbone, its dedicated tokenizer, and a causal language model wrapper.
Foundation for Modern RNNs: This is the first modern RNN architecture added to keras_hub, serving as a reference implementation and paving the way for future recurrent model additions.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This PR introduces the RWKV-7 model, a powerful RNN architecture, to keras_hub. The contribution is significant and includes the backbone, tokenizer, preprocessor, an incomplete task model, and a checkpoint conversion script. The implementation follows the modular structure of keras_hub.

However, there are several critical issues that must be addressed before this PR can be merged:

Missing Tests: The PR lacks unit tests for all new components. According to the contribution guidelines, testing is a mandatory requirement.[^1]
Incomplete CausalLM Task: The RWKV7CausalLM task model is a stub with TODOs, making it non-functional for generation.
Critical Bugs: There are critical bugs in the tokenizer and preprocessor implementations that will cause runtime errors.
Style Guide Violations: There are numerous style guide violations, including a filename typo, missing docstrings, and inconsistencies with the recommended model input structure.

I've left detailed comments on these issues. Once these are resolved, this will be a great addition to the library.

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

keras_hub/src/models/rwkv7/rwkv7_casual_lm.py

gemini-code-assist · 2025-09-28T15:14:58Z

tools/checkpoint_conversion/convert_rwkv7_checkpoints.py

+from modelscope import snapshot_download
+
+from keras_hub.src.models.rwkv7.rwkv7_backbone import RWKV7Backbone
+from keras_hub.src.models.rwkv7.rwkv7_casual_lm import RWKV7CausalLM


There is a typo in the import path: rwkv7_casual_lm should be rwkv7_causal_lm. This will cause an ImportError.

Suggested change

from keras_hub.src.models.rwkv7.rwkv7_casual_lm import RWKV7CausalLM

from keras_hub.src.models.rwkv7.rwkv7_causal_lm import RWKV7CausalLM

keras_hub/src/models/rwkv7/rwkv7_causal_lm_preprocessor.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

gemini-code-assist · 2025-09-28T15:14:59Z

keras_hub/src/models/rwkv7/rwkv7_layer.py

+            - 0.5
+        )  # soft-clamp to (-inf, -0.5)


This hardcoded value - 0.5 is a magic number. It would be better to define it as a named constant at the top of the file or as a class attribute to improve readability and maintainability.

keras_hub/src/models/rwkv7/rwkv7_backbone.py

gemini-code-assist · 2025-09-28T15:15:00Z

keras_hub/src/models/rwkv7/rwkv7_causal_lm_preprocessor.py

+
+
+@keras_hub_export("keras_hub.models.RWKV7CausalLMPreprocessor")
+class RWKV7CausalLMPreprocessor(CausalLMPreprocessor):


The RWKV7CausalLMPreprocessor class is missing a docstring. Please add a Google-style docstring explaining its purpose, parameters, and include a usage example, as specified in the style guide.¹

Style Guide References

Footnotes

All public classes must have Google-style docstrings with a summary, examples, and documentation for parameters. ↩

keras_hub/src/models/rwkv7/rwkv7_backbone.py

pass-lin · 2025-10-19T02:28:53Z

@divyashreepathihalli @mattdangerw Anybody review my code?

pass-lin · 2025-10-20T10:17:06Z

@sachinprasadhs

gemini-code-assist

Code Review

This pull request introduces a comprehensive implementation of the RWKV-7 model, a modern RNN architecture, into keras_hub. The contribution is well-structured, following the repository's modular design with a backbone, causal LM task, preprocessor, tokenizer, and a checkpoint conversion script. The code is generally of high quality.

My review highlights a few areas for improvement. There are critical bugs in the tokenizer's asset loading and saving logic that need to be addressed. The custom layers in rwkv7_layer.py have inconsistent return signatures, which could lead to runtime errors and makes the code harder to maintain. Additionally, the test files do not use the standardized helper methods from TestCase as required by the repository's style guide, which is a significant deviation. I have also pointed out some minor typos and opportunities for code clarification.

Overall, this is an excellent and valuable addition. Addressing the feedback will improve the robustness, correctness, and maintainability of the new model.

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

gemini-code-assist · 2025-10-20T10:28:54Z

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

+        """
+        path = os.path.join(dir_path, VOCAB_FILENAME)
+        with open(path, "r", encoding="utf-8") as f:
+            vocabulary = f.readlines()


The load_assets method uses f.readlines(), which keeps trailing newline characters (\n). This will cause the int() conversion to fail when parsing the vocabulary file because int() cannot parse a string with a trailing newline (e.g., int('3\n')). You should strip the newlines when reading the file.

Suggested change

vocabulary = f.readlines()

vocabulary = f.read().splitlines()

keras_hub/src/models/rwkv7/rwkv7_layer.py

gemini-code-assist · 2025-10-20T10:28:54Z

keras_hub/src/models/rwkv7/rwkv7_layer.py

+        if train_mode:
+            return output
+        return output, last_cache_x


The call method in RWKV7_ChannelMix has an inconsistent return signature. It returns one value in training mode and two in inference mode. This can lead to errors and makes the layer harder to use. The method should always return the same number of values, using None for outputs that are not computed in a particular mode.¹

Suggested change

if train_mode:

return output

return output, last_cache_x

if train_mode:

return output, None

return output, last_cache_x

Rules References

Footnotes

Methods should have a consistent return signature to avoid errors and improve predictability. Returning None for values not computed in a certain mode is a good practice. ↩

The method in has an inconsistent return signature. It returns one value in training mode and two in inference mode. This can lead to errors and makes the layer harder to use. The method should always return the same number of values, using for outputs that are not computed in a particular mode.call``RWKV7_ChannelMix``None1

Rules References

Footnotes

Methods should have a consistent return signature to avoid errors and improve predictability. Returning for values not computed in a certain mode is a good practice. ↩None

this is error issue

keras_hub/src/models/rwkv7/rwkv7_layer.py

keras_hub/src/models/rwkv7/rwkv7_backbone.py

keras_hub/src/models/rwkv7/rwkv7_causal_lm.py

tools/checkpoint_conversion/convert_rwkv7_checkpoints.py

pass-lin · 2025-10-29T07:39:28Z

@mattdangerw @divyashreepathihalli @sachinprasadhs
It's been almost a month, is there anyone willing to pay attention to me?

sachinprasadhs · 2025-11-04T21:13:41Z

Apologies for the delay in review, taking a look into this. Will add my comments.

sachinprasadhs · 2025-11-04T21:13:49Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces the RWKV-7 model, a modern RNN architecture, to keras_hub. The implementation is comprehensive, covering the backbone, causal LM task, preprocessor, tokenizer, and a checkpoint conversion script. The code is well-structured and follows the modular design principles of the repository.

My review focuses on ensuring adherence to the repository's style guide, particularly regarding testing practices and code style conventions. I've identified several areas for improvement:

The testing for the new components should be updated to use the standardized helper methods from the base TestCase. Some tests also contain incorrect assertions.
There are a few deviations from the coding style, such as the use of type hints in function signatures and a few hardcoded values that could be made more flexible.
The backbone implementation should be updated to accept a padding_mask as input, aligning with the repository's conventions.

Addressing these points will improve the consistency, correctness, and maintainability of the new model. Overall, this is a great contribution, adding a powerful and interesting architecture to the library.

keras_hub/src/models/rwkv7/rwkv7_backbone.py

keras_hub/src/models/rwkv7/rwkv7_backbone_test.py

gemini-code-assist · 2025-11-04T21:19:44Z

keras_hub/src/models/rwkv7/rwkv7_causal_lm_preprocessor_test.py

+    def test_preprocessor_basics(self):
+        result = self.preprocessor(x=["hello world hello world hello world"])
+        self.assertAllEqual(
+            result[0], [[0, 0, 0, 0, 0, 0, 4, 1, 5, 1, 4, 1, 5, 1, 4, 1]]
+        )
+        self.assertAllEqual(
+            result[1], [[0, 0, 0, 0, 0, 4, 1, 5, 1, 4, 1, 5, 1, 4, 1, 5]]
+        )
+        self.assertAllEqual(
+            result[2],
+            [
+                [
+                    False,
+                    False,
+                    False,
+                    False,
+                    False,
+                    True,
+                    True,
+                    True,
+                    True,
+                    True,
+                    True,
+                    True,
+                    True,
+                    True,
+                    True,
+                    True,
+                ]
+            ],
+        )


There are a couple of issues here:

The test assertions for the output shapes appear to be incorrect. For a sequence_length of 15, the call method adjusts it to 17. The resulting sample_weight (result[2]) should have a length of 16, but the test asserts a length of 17.

The style guide recommends using the self.run_preprocessor_test() helper for testing preprocessors, which is not used here.¹

Please correct the assertions and refactor the test to use the standard test helper.

Style Guide References

Footnotes

The style guide requires using self.run_preprocessor_test() for testing preprocessors. ↩

The Gemini number is incorrect

gemini-code-assist · 2025-11-04T21:19:44Z

keras_hub/src/models/rwkv7/rwkv7_causal_lm_test.py

+    def test_generate(self):
+        """
+        Test text generation functionality.
+        """
+        causal_lm = RWKV7CausalLM(self.backbone, self.preprocessor)
+        prompt = ["hello world"]
+        output = causal_lm.generate(prompt, 16)
+        self.assertTrue(isinstance(output[0], str))
+        self.assertTrue(isinstance(output, list))
+
+        prompt = "hello world"
+        output = causal_lm.generate(prompt, 16)
+        self.assertTrue(isinstance(output, str))
+
+    def test_generate_strip_prompt(self):
+        """
+        Test that generated text can strip the prompt from output.
+        """
+        prompt = ["hello world"]
+        causal_lm = RWKV7CausalLM(self.backbone, self.preprocessor)
+        output = causal_lm.generate(prompt, 16, strip_prompt=True)
+        self.assertFalse(output[0].startswith(prompt[0]))
+
+    def test_generate_compilation(self):
+        """
+        Test that the generate function compiles correctly and
+        reuses compiled functions.
+        """
+        causal_lm = RWKV7CausalLM(self.backbone, self.preprocessor)
+        causal_lm.generate(["hello world"], 16)
+        first_fn = causal_lm.generate_function
+        causal_lm.generate(["hello world"], 16)
+        second_fn = causal_lm.generate_function
+        self.assertEqual(first_fn, second_fn)
+
+        causal_lm.compile(sampler="greedy")
+        self.assertIsNone(causal_lm.generate_function)


According to the style guide, task model tests should use the self.run_task_test() helper method.¹ This ensures comprehensive testing, including the attached preprocessor functionality. Please refactor the tests to use this helper.

Suggested change

def test_generate(self):

"""

Test text generation functionality.

"""

causal_lm = RWKV7CausalLM(self.backbone, self.preprocessor)

prompt = ["hello world"]

output = causal_lm.generate(prompt, 16)

self.assertTrue(isinstance(output[0], str))

self.assertTrue(isinstance(output, list))

prompt = "hello world"

output = causal_lm.generate(prompt, 16)

self.assertTrue(isinstance(output, str))

def test_generate_strip_prompt(self):

"""

Test that generated text can strip the prompt from output.

"""

prompt = ["hello world"]

causal_lm = RWKV7CausalLM(self.backbone, self.preprocessor)

output = causal_lm.generate(prompt, 16, strip_prompt=True)

self.assertFalse(output[0].startswith(prompt[0]))

def test_generate_compilation(self):

"""

Test that the generate function compiles correctly and

reuses compiled functions.

"""

causal_lm = RWKV7CausalLM(self.backbone, self.preprocessor)

causal_lm.generate(["hello world"], 16)

first_fn = causal_lm.generate_function

causal_lm.generate(["hello world"], 16)

second_fn = causal_lm.generate_function

self.assertEqual(first_fn, second_fn)

causal_lm.compile(sampler="greedy")

self.assertIsNone(causal_lm.generate_function)

def test_task_basics(self):

"""

Test CausalLM task with the preprocessor.

"""

self.run_task_test(

cls=RWKV7CausalLM,

init_kwargs=self.init_kwargs,

train_data=(

["hello world", "the quick brown fox"],

),

expected_output_shape=(2, 16, 5),

)

Style Guide References

Footnotes

The style guide requires using self.run_task_test() for testing task models with preprocessors. ↩

These are the tests modified based on LLaMA Casual LM. I think Gemini's review is incorrect.

gemini-code-assist · 2025-11-04T21:19:44Z

keras_hub/src/models/rwkv7/rwkv7_causal_lm.py

+    Examples:
+    ```python
+    # Initialize the tokenizer and load assets from a local path.
+    tokenizer = RWKVTokenizer()
+    tokenizer.load_assets(rwkv_path)
+
+    # Create a preprocessor with a sequence length of 8.
+    preprocessor = RWKV7CausalLMPreprocessor(tokenizer, sequence_length=8)
+
+    # Initialize the model with a backbone and preprocessor.
+    causal_lm = RWKV7CausalLM(backbone, preprocessor)
+
+    prompts = ["Bubble sort\n```python", "Hello World\n```python\n"]
+
+    causal_lm.compile(sampler="greedy")
+
+    outputs = causal_lm.generate(prompts, max_length=128)
+    for out in outputs:
+        print(out)
+        print("-" * 100)
+    ```


The example in the docstring is not self-contained and cannot be run as-is, as it references undefined variables like RWKVTokenizer, rwkv_path, and backbone.¹ Per the style guide, examples should be comprehensive and runnable. Please update it to be a complete code snippet.

Style Guide References

Footnotes

Docstrings must include comprehensive examples showing usage patterns. ↩

keras_hub/src/models/rwkv7/rwkv7_layer.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

pass-lin · 2025-11-06T10:33:02Z

@sachinprasadhs I have updated the code based on the review.

sachinprasadhs

Added few more comments and there are few Gemini review not addressed, please address.

keras_hub/src/models/rwkv7/rwkv7_backbone_test.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

sachinprasadhs · 2025-11-07T19:39:18Z

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

+        return ret
+
+
+class RWKV_TOKENIZER:


Rename it to RWKWTokenizer to match our naming standard.

Rename it to RWKWTokenizer to match our naming standard.

In Keras Hub, "RWKV_TOKENIZER" is an internal, private class. Since this is the original implementation by the native author, we don't want to make too many changes to avoid introducing bugs. Our official API will be called "RWKWTokenizer".

@keras_hub_export("keras_hub.tokenizers.RWKVTokenizer") class RWKVTokenizer(tokenizer.Tokenizer):

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

sachinprasadhs · 2025-11-07T21:55:22Z

keras_hub/src/models/rwkv7/rwkv7_tokenizer_test.py

+    def setUp(self):
+        self.tokenizer = RWKVTokenizer(
+            ["1 ' ' 1", "2 '\\n' 1", "3 'the' 3", "4 'hello' 5", "5 'world' 5"]
+        )
+
+    def test_tokenizer_basics(self):
+        result = self.tokenizer("hello world")
+        self.assertAllEqual(result, [4, 1, 5])
+
+    def test_vocabulary_size(self):
+        self.assertEqual(self.tokenizer.vocabulary_size(), 5)
+
+    def test_tokenize_and_detokenize(self):
+        # Test detokenization
+        text = self.tokenizer.detokenize([[4, 1, 5]])
+        self.assertEqual(text[0], "hello world")
+
+    def test_special_tokens(self):
+        self.assertEqual(self.tokenizer.pad_token_id, 0)
+        self.assertEqual(self.tokenizer.end_token_id, 2)


This whole setup needs to follow our standard testing procedure, to run run_preprocessing_layer_test add run_preset_test and all presets tests which you can skip it for now. later we can enable once presets are uploaded.

keras_hub/src/models/rwkv7/rwkv7_layer.py

sachinprasadhs · 2025-11-07T23:19:46Z

keras_hub/src/models/rwkv7/rwkv7_layer.py

+class GroupNorm(keras.layers.GroupNormalization):
+    """Group normalization with backend-specific handling.
+
+    Extends Keras GroupNormalization with PyTorch backend support.
+    """


What is the issue with the default group norm for pytorch backend?

What is the issue with the default group norm for pytorch backend?

The PyTorch backend runs faster than the Keras implementation and aligns better with the original model, so I believe it should be kept.

pass-lin · 2025-11-09T08:43:12Z

@sachinprasadhs I think I have already revised my code according to the review. I have responded to all the parts that were not corrected.

sachinprasadhs · 2025-11-10T23:45:29Z

Hi, Thanks for showing interest in porting the models into Keras ecosystem. We appreciate you taking the initiative to contribute.

We took a look at the model's Hugging Face page and noticed that its usage seems to be quite low at the moment. Since every model we add requires a long-term commitment to maintenance and testing, we've decided not to add this specific model to the official library at this time.

We definitely think you should keep the implementation available in your own repo, it’s a great way to share your work with those who need it!
The best way to contribute is to propose a new model in a GitHub issue or to pick up one of the existing issues where we're looking for help.

pass-lin · 2025-11-11T03:00:34Z

Hi, Thanks for showing interest in porting the models into Keras ecosystem. We appreciate you taking the initiative to contribute.

We took a look at the model's Hugging Face page and noticed that its usage seems to be quite low at the moment. Since every model we add requires a long-term commitment to maintenance and testing, we've decided not to add this specific model to the official library at this time.

We definitely think you should keep the implementation available in your own repo, it’s a great way to share your work with those who need it! The best way to contribute is to propose a new model in a GitHub issue or to pick up one of the existing issues where we're looking for help.

Hello, may I sincerely recommend the RWKV7 model to you? It is the most powerful pure RNN (Recurrent Neural Network) large language model in modern times and also the largest parameterized pure RNN LM. This is a series of language models, and its original repository has 14K stars. You can see that there is currently a lot of work being built around RWKV at this link: https://www.rwkv.com/.

Compared to transformer models, RNN models, due to their lower computational requirements and constant-level cache, are more suitable for deployment on mobile and other edge devices than transformer-based models. However, it seems that keras-hub currently lacks this category of pure RNN models, and RWKV7 appears to be a perfect candidate for the first one. It is powerful in its own right, and my implementation of RWKV7 also serves as a reference for future RNN LLM contributors.

Additionally, you may notice that even on Hugging Face, the model BlinkDL/rwkv7-g1 has 135 likes, a number comparable to DepthAnythingV2. This indicates that RWKV7 has a dedicated group of loyal users.

I would be happy to assist the keras-team with long-term maintenance to ensure that RWKV7 works smoothly on the future keras-hub. I hope you will reconsider adding RWKV7 to keras-hub.

sachinprasadhs · 2025-11-11T04:09:27Z

We are observing the page here https://huggingface.co/BlinkDL/models, for any of the variations there is no last 30 days download numbers.

pass-lin · 2025-11-11T04:37:22Z

We are observing the page here https://huggingface.co/BlinkDL/models, for any of the variations there is no last 30 days download numbers.

You can see on their G1 model homepage (https://huggingface.co/BlinkDL/rwkv7-g1/tree/main) that they updated the latest 13B model just ten days ago. It is clearly unreasonable that there have been no downloads in the past month.

The RWKV community is a very active one, with the authors constantly making updates. However, their style is quite different from the mainstream, as they prefer to save their models in the same repository. This is a disadvantage when it comes to download statistics, referring to Hugging Face's download rules, which mainly count files such as config.json, config.yaml, hyperparams.yaml, params.json, and meta.yaml. In contrast, ModelScope counts the downloads of the entire repository (downloading a single file does not increase the download count). Since RWKV often only requires downloading a single weight .pth file, you would observe extremely low download counts.

Considering that this is an RNN model produced by a Chinese company, with its main users being Chinese, the primary method of use is to directly download the .pth weights from https://www.rwkv.cn/. Then, when calling the model, users rely on RWKV's own ecosystem, such as:

These ecosystems all have a high number of stars, which shows that RWKV has a significant user base. Users can also directly download the weights within RWKV-Runner for use.

These ecosystems are user-friendly and easy to use for both individual and industrial users, but they are not very friendly for researchers. The keras-hub version of RWKV7 can address this issue, allowing researchers to more flexibly train, improve, and utilize modern RNN large models based on their needs.

sachinprasadhs · 2025-11-11T05:09:27Z

Thanks for providing the details, will again circle back with the team and get back to you on this one.

pass-lin · 2025-11-11T06:21:39Z

Thanks for providing the details, will again circle back with the team and get back to you on this one.

Finally, please allow me to add one more point.

From February 26, 2025, to today, RWKV-LM has gained 1,000 stars (from 13.1k to 14.1k). It has also increased by 100 stars from October 16 to today. You can see this trend of star growth at the following link.

I believe this demonstrates that RWKV is a very popular and highly active community.

tempdragon · 2025-11-11T10:09:29Z

BTW, RWKV is also mentioned here by the Linux Foundation.
https://lfaidata.foundation/projects/rwkv/

pass-lin added 3 commits September 28, 2025 22:48

add RWKV

195ef79

fix

7bc36b5

fix

7d4a7a1

gemini-code-assist bot reviewed Sep 28, 2025

View reviewed changes

pass-lin added 7 commits October 7, 2025 23:15

add inference

e5bb446

add inference

afcff31

add tokenizer doc

ec0baf3

add doc

bd6c618

add test case

4201a7f

fix test

897a64b

fix doc

ff11f94

divyashreepathihalli requested a review from sachinprasadhs October 19, 2025 18:38

gemini-code-assist bot reviewed Oct 20, 2025

View reviewed changes

pass-lin added 3 commits October 20, 2025 18:44

fix gemini review.

ce13d54

format.

0e36b4a

format.

7218888

pass-lin added 5 commits October 29, 2025 16:02

save tokenizer

cc5815b

fix tokenizer load

dd80464

fix save

5e8723d

renew preset

f223002

renew perset.

b2b1573

pass-lin force-pushed the rwkv branch from c2afdde to b2b1573 Compare November 3, 2025 09:11

debug for remat

c5ebeec

gemini-code-assist bot reviewed Nov 4, 2025

View reviewed changes

pass-lin added 6 commits November 6, 2025 12:18

modify

7f8bda7

modify

00200a8

modify

e97b458

modify

75a4415

modify

8c3638b

modify rwkv casual lm.

468dce1

sachinprasadhs reviewed Nov 7, 2025

View reviewed changes

modify tokenizer

637fdcb

pass-lin added 9 commits November 9, 2025 18:20

fix test bug

24e67ec

fix test bug

4eb4845

fix test bug

be4a649

fix test bug

28700d9

fix test bug

2e2d5c0

fix test bug

97b39cf

fix test bug

44e6476

fix test bug

b7ed34b

fix test bug

b3e33fd

kmk142789 approved these changes Nov 10, 2025

View reviewed changes

pass-lin added 3 commits November 13, 2025 13:46

modify RWKV7CausalLMPreprocessor

75c8a88

modify RWKV7CausalLMPreprocessor

eac1505

modify RWKV7CausalLMPreprocessor

06ec6c5

	from keras_hub.src.models.rwkv7.rwkv7_casual_lm import RWKV7CausalLM
	from keras_hub.src.models.rwkv7.rwkv7_causal_lm import RWKV7CausalLM



		@keras_hub_export("keras_hub.models.RWKV7CausalLMPreprocessor")
		class RWKV7CausalLMPreprocessor(CausalLMPreprocessor):

	vocabulary = f.readlines()
	vocabulary = f.read().splitlines()

ADD RWKV7 #2421

Are you sure you want to change the base?

ADD RWKV7 #2421

Conversation

pass-lin commented Sep 28, 2025 • edited by sachinprasadhs Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📚 References

🔗 Pre-trained Checkpoints (ModelScope)

Uh oh!

gemini-code-assist bot commented Sep 28, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist bot Sep 28, 2025

Choose a reason for hiding this comment

Style Guide References

Footnotes

Uh oh!

Uh oh!

pass-lin commented Oct 19, 2025

Uh oh!

pass-lin commented Oct 20, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist bot Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist bot Oct 20, 2025

Choose a reason for hiding this comment

Rules References

Footnotes

Uh oh!

pass-lin Oct 20, 2025

Choose a reason for hiding this comment

Rules References

Footnotes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pass-lin commented Oct 29, 2025

Uh oh!

sachinprasadhs commented Nov 4, 2025

Uh oh!

sachinprasadhs commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Nov 4, 2025

Choose a reason for hiding this comment

Style Guide References

pass-lin commented Sep 28, 2025 •

edited by sachinprasadhs

Loading

pass-lin Nov 8, 2025 •

edited

Loading

pass-lin commented Nov 11, 2025 •

edited

Loading