Les concepteurs & les patrons des IA aiment répéter que si leurs solutions sont racistes & biaisées, c’est parce que les usagers & les travailleurs du clic qui produisent la data d’entraînement sont racistes & biaisés. Galactica de Meta atteste du contraire. Entraîné sur un corpus de sources de qualité, le modèle a sorti des résultats si mauvais qu’on a dû le fermer.https://arstechnica.com/information-technology/2022/11/after-controversy-meta-pulls-demo-of-ai-model-that-writes-scientific-papers/
[Reposting this since the Mastodon threading messed up and it got lost in the replies]
There is a #Latex package that makes it easier to use #AnnotatedEquations! Also, it is much more visually appealing since the heights of the various boxes are normalized.
Check it out! Thanks to twitter user @\scien_ti_st for doing all the hard work to create this.
Link to original GitHub Repo:
https://github.com/synercys/annotated_latex_equations
Link to latex CTAN package:
https://ctan.org/pkg/annotate-equation.
#Zettlr is an open source “#Markdown editor for the 21th century” that comes with #pandoc included. It works on all major platforms, plays well with reference managers, supports the “#zettelkasten” system for note taking, and more.
I wrote a piece for the New York Times about how scientists used Twitter during the Covid pandemic and about what comes next.
Babbage Patch Finds--a daily dive into CBI's vast computing archives
UNIVAC 1103 SCIENTIFIC
The US Computer Industry began with Engineering Research Assoc. (ERA) former Naval cryptographers in MPLS/SP moving into digital computing & Eckert & Mauchly Computer Co. in Philly both in 1946 & both acq. by Remington Rand in early 50s.
Here ERA 2nd computer mid 50s, the 1103 Scientific & two women programmers/operators
#Histodons #history #gender #computing #histodon #compsci #programming #sociology
A quick introduction:
I am a historian of international organizations and I am interested in the impact of digital technologies on our research practices.
A specialist in network analysis applied to history, I also teach data visualization, both historical and non-historical.
Swiss, traveler, cat lover, digital humanist, I loved blogging and micro-blogging, so I'm looking forward to Mastodon.
Are there already any #universities or #HigherEducation institutions running their own Mastodon instances? Thinking about whether it makes sense for my employer to host a server for the various schools and institutes. I am a big fan of #SelfHosting stuff.
As promised -- I set up the subdomain and created a wireless bridge to the new dev server sitting in my bedroom closet. Here is a script showing how to do a basic ngram search.
I'm keeping elasticsearch wide open for people with ES experience to use and help test. All 550 million Twitter users are being loaded into the index.
You should be able to see the mapping file I put together by just going to twitter-elastic.pushshift.io/twitter_user/_mapping in your browser.
Only GET requests should get through -- otherwise someone could just wipe the cluster. :) This is in a dev environment so even if someone did, it isn't the end of the world.
Good luck and have fun!
#!/usr/bin/env python3
import ujson as json
import requests
import logging
logging.basicConfig(level=logging.INFO)
def ngram_search(q: str = None, size: int = 10) -> dict:
'''Function to return Twitter user accounts matching a partial screen_name string'''
# Initialize vars
headers, data = dict(), dict()
domain = "twitter-elastic.pushshift.io" # This will change once I get DNS set up -- something like twitter_users.pushshift.io
elastic_url = f"http://{domain}/twitter_user/_search"
# Set up required headers (Elasticsearch requires content-type)
headers['content-type'] = "application/json"
# Prepare search query string (as a data field for a GET request)
data['query'] = {}
data['query']['bool'] = {}
data['query']['bool']['must'] = m = []
# First match condition (can easily add more since we are using bool -> must query type)
match = {}
match['match'] = {}
match['match']['screen_name.ngram'] = q
m.append(match)
# Set up misc search attributes (sort, size, etc.)
data['sort'] = {'followers_count':'desc'}
data['size'] = size
# Make request and return response if successful
r = requests.get(elastic_url, headers=headers, data = json.dumps(data))
if r.ok:
return r.json()
else:
logging.error(r.content)
# Perform example search to show Twitter accounts containing "NASA" and sorted by follower count
data = ngram_search(q="nasa", size=25) # Case does not matter
hits = data['hits']['hits']
for hit in hits:
hit = hit['_source']
print(hit['screen_name'], hit['followers_count'])
Facebook, Twitter: et si c'était la fin des réseaux sociaux? - rts.ch - Sciences-Tech.
I wanted a version of my Twitter archive that didn't display t.co short urls, so ...
I'm using @Wikidata to find people to follow on Mastodon. Presently it knows of 3704 Mastodon accounts (query takes ~10 secs to run). Please add more if you know of notable people joining the network. https://w.wiki/5zi5
“This article processing charge is to cover the costs of peer review, copyediting, typesetting, long-term archiving, and journal management.”
OK, it’s good that MDPI explicitly say what the #APCs are (supposedly) needed for. However:
– #peerreview is done for free, only the *management* of the peer review is done by the #publisher.
– Copyediting? What copyediting?
– Typesetting? There are LaTeX and Word templates, the #typesetting is done by the authors, for free.
– Long-term archiving is done by the national library.
So, what remains is “journal management,” i.e., counting the money.
Our Center for Law and Digital Technologies (eLaw) (Leiden University, Netherlands) is hiring a fulltime assistant professor Law and Technology. For more information: https://www.universiteitleiden.nl/en/vacancies/2022/q4/22-73213168assistant-professor-of-elaw-08-of-10-fte
Boost = ❤️ 😄
r/emacs - Top [MONTH]
Mastodon alternative timeline layout
https://www.reddit.com/r/emacs/comments/yziqs8/mastodon_alternative_timeline_layout/
#emacs
Frage an die #schweizerbubble, wünscht ihr euch die Accounts der SRG auf Mastodon? Falls ja, auf welchem Mastodon-Server? MERCI für jede Stimme und jeden Kommentar. 🇨🇭
Damit es auch nur ein paar wenige Stimmen gibt, ist ein #reblog oder #boost sehr willkommen 🙏
@polymerwitch okay, truly? the fact that it's actually open source software and not proprietary with a guarded api meant for brands doing data gathering means that the tooling around mastodon is so much better
like I'm typing this from an emacs buffer and I *know* it's not going to get me banned for unauthorized use of an API key like a lot of services these days [glares at discord]
now that our server qoto.org is (temporarily) not listed in the main mastodon website joinmastodon.org, it seems convenient to promote our server directly in the birdsite. Therefore please follow the link below and like or retweet the post, so that it will be ranked higher in the search field of the #birdsite
@freemo @Gargron
https://twitter.com/eantonicelli/status/1594080051494002688?t=DXyofpLz3gIXczviUa_TNw&s=19
Associate professor of digital humanities, University of Lausanne, Switzerland
Professeur associé en humanités numériques, Université de Lausanne, Suisse