The same knowledge pipeline yields different results on Dify 1.9.2 and Dify 2.0.0 Beta. #27516
Replies: 1 comment 4 replies
-
|
The difference in retrieval results between Dify 1.9.2 and 2.0.0 Beta—even with the same knowledge pipeline and configuration—is likely due to major changes in the knowledge pipeline and retrieval implementation introduced in 2.0.0 Beta. The 2.0.0 Beta release included a significant refactor of the knowledge pipeline, adding new modules and changing how documents are processed, chunked, indexed, and retrieved. This means that even with identical user-facing settings, backend changes (like chunking logic, embedding, or retrieval strategies) can yield different results for the same input PDF citation. Other factors that can cause discrepancies include differences in backend properties (such as indexing technique, embedding model, document segmentation, and metadata filtering), changes in default retrieval strategies, or even differences in how Celery worker queues are configured and used for document processing citation citation. To troubleshoot or align results, check that all backend settings (embedding model, chunking, indexing method), document statuses, and pipeline configurations are truly identical. If you migrated data between versions, re-indexing or re-uploading documents may be necessary to ensure consistency citation. If you need to compare retrieval logic in detail, reviewing the new rag_pipeline modules and related workflow changes in 2.0.0 Beta may help clarify where the differences arise. To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Self Checks
1. Is this request related to a challenge you're experiencing? Tell me about your story.
I created knowledge bases using the exact same knowledge pipeline on both Dify 1.9.2 and Dify 2.0.0 Beta, with all configuration settings identical. However, the retrieval test results for the same PDF turned out to be different. Could you please help me understand why this might be the case?
2. Additional context or comments
No response
Beta Was this translation helpful? Give feedback.
All reactions