**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 05:21

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 05:21

Carl T. Bergstrom @ct_bergstrom@fediscience.org

May 13, 2023, 05:21

Carl T. Bergstrom @ct_bergstrom@fediscience.org

Continued: it's striking that the authors didn't even use conventional machine learning procedures to develop their classifier.

Rather, they chose features that made sense to them as indicators.

These were not even indicators of fake papers, but rather indicators of non-response to a survey —which of course is a very different thing than authorship of a fake paper.

c32d2885d55af2ee.png

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 05:43

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 05:43

May 13, 2023, 05:43

Carl T. Bergstrom @ct_bergstrom@fediscience.org

The mind just boggles.

"Note that the tallying rule [private email, no international collaborators] identifies likely fakes, but it cannot determine with certainty whether a given publication is actually (legally) a fake. Nevertheless, it is a reliable tool to red-flag scientific reports for further analysis and is a rational basis to estimate the upper value of fake publishing in biomedicine."

RELIABLE TOOL?

RATIONAL BASIS?

b248ca9596998001.png

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:08

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:08

May 13, 2023, 07:08

Carl T. Bergstrom @ct_bergstrom@fediscience.org

And then there's this:

"It is important to keep in mind that our indicators provide a red flag, not legal proof, that a given manuscript or publication might be fake. However, it is the authors' burden of proof to demonstrate that their science can be trusted."

BULLSHIT.

It is absolutely not the authors' burden of proof to demonstrate that their science can be trusted when the criteria used to question their work are (1) their email address and (2) the lack of international collaborators.

65cda72ad3621374.png

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:15

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:15

May 13, 2023, 07:15

Carl T. Bergstrom @ct_bergstrom@fediscience.org

And then there's the survey they used to "validate" their hopeless instrument.

How would you respond if you received this from some random account?

As a misinformation researcher, I get all sorts of politically motivated harassment that looks a lot like this.

Last thing I'm going to do is give them information they could look up themselves about my dean, HR people, etc.

To presume that not answering implies guilt is outrageous.

84fbfa83ecdc2f90.png

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:30

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:30

May 13, 2023, 07:30

Carl T. Bergstrom @ct_bergstrom@fediscience.org

Finally, I want to stress that it is no defense whatsoever to say that the algorithm could be used simply as a preliminary screen to red-flag papers for additional scrutiny.

If one is going to propose machine-learning classifier to make instrumental decisions that affect careers and reputations, one must carefully and thoroughly consider issues of fairness and risks algorithmic harm that might arise.

The authors of this preprint don't even mention such issues.

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:34

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 13, 2023, 07:34

May 13, 2023, 07:34

Carl T. Bergstrom @ct_bergstrom@fediscience.org

Not only does the Science story fail to call them on this; its author falls into the one of the oldest and most pernicious traps around algorithmic bias.

The author contrasts the use of "automated methods" with reliance on "human prejudice", entirely overlooking the fact that the automated methods propose here are nothing but the explicit and fully-descirbed instantiation of human prejudice.

It's truly an embarrassment all around.

3ce65382bc967396.png

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 16, 2023, 04:14

**Carl T. Bergstrom** @ct_bergstrom@fediscience.org · May 16, 2023, 04:14

May 16, 2023, 04:14

Carl T. Bergstrom @ct_bergstrom@fediscience.org

UPDATE: The paper is now discussed on pubpeer, and the lead author has responded.

I find his response to be a completely unsatisfactory effort at misdirection, but read it and decide for yourself.

The irony of this guy writing a paper that spectacularly overestimates the frequency of fake papers using ridiculous methods and then saying "The loss of trust in science is the key issue we should worry about."

https://pubpeer.com/publications/0CE23D5DD5AD6929404AF03D700623

08a82c24305be504.png

**Joe** @twitskeptic@qoto.org · May 16, 2023, 04:54

**Joe** @twitskeptic@qoto.org · May 16, 2023, 04:54

May 16, 2023, 04:54

Joe @twitskeptic@qoto.org

@ct_bergstrom "I suggest that you read the books by my co-author G. Gigerenzer to learn how to interpret numbers correctly"

Think he's get much better results if he added a condescension feature to his decision tree.

**Joe** @twitskeptic@qoto.org · 2023-05-16T05:27:52Z

Joe @twitskeptic@qoto.org

@ct_bergstrom This partially explains their thinking - Gigerenzer is a proponent of something called "fast-and-frugal trees" which appear to just be simple decision trees. Maybe there's more to it. https://en.wikipedia.org/wiki/Fast-and-frugal_trees

May 16, 2023, 05:27 · · · ·

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…