Today at , I will be presenting our work on the evaluation of the historical adequacy of masked language models (MLMs) for . There are several models like this, and they represent the current state of the art for a number of downstream tasks, like semantic change and text reuse detection. However, a historical researcher, philologist or else would want to be sure that such models really represent the historical period of interest. For example, it would be an embarrasing hallucination if St. Augustine showed up in the context of the Roman senate.

Our evaluation confirms a known problem: LLMs and masked models in particular are trained on corpora without attention to historical periods. Unlike other research we've done on Early Modern English, this problem leads to models being barely distinguishable when it comes to their ability to generate based on a historical period. Even though history is a case where it is most obvious when models go wrong, this type of contamination is a known problem for LLM training overall, think of different legal jurisdictions using the same language, dialects in programming languages, etc.

This research was generously supported by AgileLab.

The full paper is available at:
anthology.ach.org/volumes/vol0

I would rather define myself as a sceptic (I've documented the reasons in my publications at zotero.org/mapto/publications), but here's what a professional adopter, i.e a founder of a GenAI startup (that I value lots) has to say about the state of technology. No need to say we diverge on the outlooks, but that's not the point here

"In a broader sense, however, today’s ruling is of a piece with this Court’s recent tendencies. “[R]ight when the Judiciary should be hunkering down to do all it can to preserve the law’s constraints,” the Court opts instead to make vindicating the rule of law and preventing manifestly injurious Government action as difficult as possible….. This is Calvinball jurisprudence with a twist. Calvinball has only one rule: There are no fixed rules. We seem to have two: that one, and this Administration always wins."

techdirt.com/2025/08/22/justic

History keeps repeating, dictators keep pulling each-other's strings.

"In interviews for a book about his Middle East peace efforts, Trump, according to its author, used an expletive to describe the embattled prime minister — “Fuck him,” he reportedly said — and accused Netanyahu of disloyalty."

timesofisrael.com/trump-posts-

This is how ruthless the Israeli military complex is

The sign talks of"protecting the state of Israel", remaining silent to the fact that this offensive "protection" actually costs hundreds of thousands of lives in Israel and the region.

Context:
theguardian.com/science/2025/j

It doesn't take much to create a alternative to . We just need structured profiles (featuring experience and education) and posts (that let people know if they could still apply or position is taken).

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.