One thing that I am always interested about: how many releases that my scripts thought were OK now have errors? I compared the data of smells of the data dump of December 2019 with the data dump of January 2020 and got the following chart (as always, the columns indicate the range of release numbers in the database: first column is everything with release number < 1,000,000, second column is everything between 1,000,000 and 2,000,000, and so on):
Releases in the Discogs database where an error was introduced in |
So actually: not too bad. It is also consistent with the change patterns I have seen over the years. The reasons for the peaks for older releases and newer releases:
- older releases are expanded with new information. This also means that more errors are introduced there.
- newer releases are usually added first, and expanded later by other people. This also increases the chances of errors being introduced.
Comments
Post a Comment