-
Notifications
You must be signed in to change notification settings - Fork 210
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
error when loading Wikidata - Non unique primary key on DB.DBA.RDF_LANGUAGE. #1329
Comments
What is the effect of this error? Is the rest of the file ignored? Or is it just a matter of one triple not being loaded? |
You are correct. Any loading error will stop parsing the file at that point, so the rest of the file is ignored. If you are using the virtuoso bulk loader, you can check if there are any parts that returned an error with the following command:
If there are any errors, you can create a new file starting from the next line on the original file and do another round of bulkloading. Note that the bulkloader will skip over files that have already been loaded, so you will have to use a new file name such as Couple of questions:
|
1/ I assume that you want only
2/ I created the .ttl parts by reading the full file, storing the namespace declarations and then splitting the file by looking for the next item boundary after reading 2GB and writing that section preceeded by the namespace declarations. 3/ I am running on Ryzen 9950X with 16 cores and 32 threads with 196GB of memory. 4/ I used 10 loaders. 5/ Here is virtuoso.ini
|
Additionally, can you run the following 2 queries and show me the output:
|
That's not going to be useful as I have since restarted the load (and this time it is working). I seem to remember that I reported this bug quite some years ago and even had a patch for it. I can't find the bug message but here is the patch (against version 7.20). Patch the Turtle loader, editing the end of
|
Thanks for your feedback. I will discuss this within development and let you know. |
When I tried to load Wikidata into the current version of Virtuoso Open Source I got the following error just after starting:
08:50:42 PL LOG: File /home/local/wikidata/split/wikidata-part-005.ttl error 23000 TURTLE RDF loader, line 112: SR197: Non unique primary key on DB.DBA.RDF_LANGUAGE.
I'm loading from a bunch of Turtle files split up from the Wikidata RDF dump.
I think I reported this error several years ago but I cannot find my original bug report. As I recollect the problem is a timing one when several files are being loaded at the same time.
The text was updated successfully, but these errors were encountered: