When I decided to jump into the fediverse with both feet, I signed up for a BookWyrm account at @bookrastinating. When I tried to import my history/library, there were two big issues.

The first was timing. I joined and started an import at about the same time many, many, many other people did, and the server promptly fell over. The owner of Bookrastinating was helpful and friendly and eventually the queue started moving again, so that’s not an issue anymore.

The second was not actually a problem with BookWyrm at all, but with GoodReads! It turns out that a bunch of books I’d previously tagged as read on GoodRead were no longer the books they had been. No offense to Marc Blake, author of “How to Be a Sitcom Writer: Secrets from the Inside,” but I’ve never read that book. When I “shelved” it on January 9, 2017, I assigned it “baroque, cycle, fiction, hardback, series, read,” so it seems very clear that what I actually shelved was a book by Neal Stephenson, and checking now shows that GoodReads only knows I’ve read the second book in the tagged “baroque cycle” series, but it has lost the first and third.

I’m not sure when the GoodReads database was corrupted, and in a review of my Reading Challenge book lists for 2022, 2021, 2020, 2019, 2018, and 2017, everything seems probably-correct. But my 2017 Reading Challenge also does not include the book I mentioned above, the book that caught my eye as clearly-incorrect.

When I look at books shelved in 2017, I see many that are definitely not right, although they don’t have “read” dates. I’m sure After the Martian Apocalypse: Extraterrestrial Artifacts and the Case for Mars Exploration is a perfectly fine book, but it definitely wasn’t what I tagged with “fiction paperback series dragon king trilogy” on January 9, 2017. Love on the Dotted Line could be a fantastic romance novel, but it’s not what I tagged “paperback anthology science fiction” on January 9, 2017. A dual-language collection of Italian women’s poetry is definitely not what I tagged as a volume of the “Writers of the Future” science fiction anthology series. And so on. The ones I notice most easily are the titles I would never read, but the date January 9, 2017, stands out. Sorting by date added, I can see that some of the books added on that date seem correct. I know the books, and the tags match the books. Most do not.

I’m sure there’s some irony in an import failure on the fediverse alerting me to serious corruption on Amazon-owned GoodReads, and the result stopping me from actually migrating.

I clearly cannot trust GoodReads, as they’ve broken the first and second rules of a database: they’ve lost data, and represented data falsely to belong to me when it doesn’t. I’m not sure which of those is the first rule and which is the second, but both seem bad.

I also cannot import my entire library from GoodReads into BookWyrm, because I don’t want to start with bad data. I think it’s time to let most of the past go, and create a cleaned-up import file with just my reading history from 2017 onward.

Good thing I have the day off tomorrow!

As I continue to explore GoodReads with an eye to exiting that site, I am amazed at just how terrible it is. I exported my library, intending to delete everything that wasn’t read in the last five years, but the resulting CSV file is missing so much!

There is no “Date Read” for many books that are only present in the file because I read them, and a surprising number of books are missing ISBNs, too. I’m talking about books in print, not ebooks or audiobooks that might not even have ISBNs.

It is clear that Amazon does not prioritize accuracy in export files, and as I explain previously, they don’t prioritize accuracy in their own database either, although the effect is seen in different ways.

The long and short of it is that I’m apparently going to need to manually edit the CSV file in order to get anything like accurate data, and giving up the convenient GoodRead mobile app for a BookWyrm server seems more than ever like a fair trade. I’ll happily pitch a few bucks a month toward @bookrastinating for their help getting things set up, and to avoid propping up Amazon’s less-than-lackluster efforts with GoodReads.

This seems like a good way to spend a day off work, right?

Show thread

I’m not sure which is worse, exactly: GoodReads exporting my library with no “Date Read” filled in, despite having a “Date Read” on the site, or GoodReads exporting my library with the wrong “Date Read” filled in, one that is a day off from the one on the site.

How does that even happen? Given that it’s off by a day when it’s off, I’m guess it’s a time zone issue, with either the site or the export function using GMT and the other using my local time, six hours removed.

That doesn’t explain the many rows in which the date is just missing. Nor the missing ISBNs.

It’s been clear for a long time that Amazon doesn’t actually care about making GoodReads a GoodSite, but this is really, really bad.

What I’ve settled on doing is opening the export CSV in a spreadsheet in one pane, sorted by reverse “Date Added,” and looking at “My Books” on GoodReads in another pane, sorted the same way.

The Arc browser from arc.net/ is working pretty well for this.

Show thread
Follow

The CSV spreadsheet is usually the one that’s a day ahead, so I guessed it was the one using GMT. Except… every now and then, it’s the one that’s a day behind. The inconsistency rules out GMT shenanigans, I think, but also, I don’t ever supply a time when marking a book as read. I select a year, month, and day. Which suggests it’s arbitrarily storing a time, and the fact that it’s off so often suggests it’s storing a time within five or six hours of midnight. And apparently inconsistently.

Another amusing bug anecdote is that books I finished reading on Oct 1, 2022, are just listed on the site as having been read “Oct 2022.” No date shown. The CSV spreadsheet has a date, of course.

It seems like there’s an entire class of date/time bugs on GoodReads, before we even get to the missing ISBNs and badly-corrupt data from 2017.

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.