-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot index 5MB PDF with default settings using bedrock #94
Comments
Further testing shows that CHUNK_SIZE=5000 does not fully fix the issue, more testing needed, ChatGPT accepts this document but Claude says it is too big |
I tested with I was unable to to see any issue with indexing this pdf (5 MB) and querying in both with docker and with only the rag_api. If you are getting a file upload error. I would run just the rag_api (and database) and see if you are able to use the If your not having an issue with that, it may be that you have Thanks for bringing this up @dirkpetersen |
Thanks @ScarFX I set RAG_USE_FULL_CONTEXT=false but the problem persists.
It seems there is a timeout: Next is trying RAG API standalone |
I try to upload this file (5MB, 2,384,000 chars) to LibreChat with bedrock API activated
https://pve.proxmox.com/pve-docs/pve-admin-guide.pdf
I tried dev and dev-lite containers but am getting an upload error ("An Error occurred while uploading a file) in the LibreChat GUI but no real error in the logs with DEBUG_RAG_API=true, Strange
If set CHUNK_SIZE=5000 it works however, these are my RAG settings
The text was updated successfully, but these errors were encountered: