**Johannes Hoffart** · Jan 20, 2023, 09:50

Johannes Hoffart

Johannes Hoffart @johannes@qoto.org

79 Toots

135 Following

79 Followers

Website: https://www.hoffart.ai

CTO, AI at SAP - #nlproc #kg #ai (views are my own and do not reflect those of my employer)

Joined Sep 2019

135 Following 79 Followers

Toots Toots and replies Media

Johannes Hoffart boosted

**Sören Auer 🇪🇺🇺🇦** @soeren_auer@mstdn.social · Jan 20, 2023, 09:50

Jan 20, 2023, 09:50

Sören Auer 🇪🇺🇺🇦 @soeren_auer@mstdn.social

Recent events have demonstrated how crucial resilience (e.g. of supply chains) is for our society. Semantic technologies can play a crucial here.

We will organize the #D2R2 (Linked Data-driven Resilience Research) #workshop at @eswc_conf@twitter.com
in May 2023 in Crete. We are looking forward to your contribution. Submission deadline is March 9. Check more details on our event page! https://d2r2.aksw.org #ESWC23 #CoyPu_Project #Resilience #LinkedData #cfp

**Johannes Hoffart** · Jan 16, 2023, 12:57

Johannes Hoffart boosted

**Max-Planck-Gesellschaft** @maxplanckgesellschaft@wisskomm.social · Jan 16, 2023, 12:57

Jan 16, 2023, 12:57

Max-Planck-Gesellschaft @maxplanckgesellschaft@wisskomm.social

The artificial-intelligence chatbot ChatGPT can write fake abstracts that scientists have trouble distinguishing from those written by humans. Increasing sophistication of chatbots could undermine research integrity and accuracy, researchers fear.

https://www.nature.com/articles/d41586-023-00056-7

#chatbots #ChatGPT #research #AI #ArtificialIntelligence

via @Nature

**Johannes Hoffart** · Jan 16, 2023, 03:19

Johannes Hoffart boosted

**naaclmeeting** @naaclmeeting@sigmoid.social · Jan 16, 2023, 03:19

Jan 16, 2023, 03:19

naaclmeeting @naaclmeeting@sigmoid.social

Hello NLP researchers around the globe! All ACL major conferences (@aclmeeting, @eaclmeeting, @aaclmeeting, and @emnlpmeeting) now have an account here. Please spread it word! #NLPRoc

**Johannes Hoffart** · Jan 06, 2023, 00:25

Johannes Hoffart boosted

**Kristin Branson** @kristinmbranson@social.coop · Jan 06, 2023, 00:25

Jan 06, 2023, 00:25

Kristin Branson @kristinmbranson@social.coop

I found the papers "Scaling Laws for Neural Language Models" (OpenAI, 2020) and "Training Compute-Optimal Large Language Models" (DeepMind, 2022) interesting:
https://arxiv.org/pdf/2001.08361.pdf
https://arxiv.org/pdf/2203.15556.pdf
They do a LOT of experiments training large language models (causal transformers) with varying hyperparameters, in particular model size, shape, batch size, and training data set size over many orders of magnitude. 1/?

**Johannes Hoffart** · Jan 06, 2023, 00:26

Johannes Hoffart boosted

**Kristin Branson** @kristinmbranson@social.coop · Jan 06, 2023, 00:26

Jan 06, 2023, 00:26

Kristin Branson @kristinmbranson@social.coop

DeepMind's paper refutes this last claim, and finds that both are equally useful.
The differences between DeepMind & OpenAI's papers matter in terms of forecasting how big LLMs need to get. They arrived at these different conclusions because DeepMind did more learning rate tuning. This blog post https://severelytheoretical.wordpress.com/2022/07/18/thoughts-on-the-new-scaling-laws-for-large-language-models/ hypothesizes that DeepMind's paper might also be not doing enough hyperparameter tuning, and the scaling law may be less severe, perhaps not even a power law.
3/3

Show thread

**Johannes Hoffart** · Jan 06, 2023, 07:50

Johannes Hoffart boosted

**Ben Lorica 罗瑞卡** @bigdata@indieweb.social · Jan 06, 2023, 07:50

Jan 06, 2023, 07:50

Ben Lorica 罗瑞卡 @bigdata@indieweb.social

On #TheDataExchangePod I speak with Mark Chen, Research Scientist at OpenAI. We discuss the evolution of DALL·E, key research developments that led to DALL·E 2, data sources, safety measures, ML models needed for its success. #machinelearning #dalle2 #dalle #AI #generativeai https://thedataexchange.media/exploring-dalle-2/

**Johannes Hoffart** · Jan 06, 2023, 09:18

Johannes Hoffart boosted

**Pieter Colpaert** @pietercolpaert@mastodon.social · Jan 06, 2023, 09:18

Jan 06, 2023, 09:18

Pieter Colpaert @pietercolpaert@mastodon.social

I do however have high hopes for #blogic and RDF+Surfaces to make the interpretation of RDF vocabularies interoperable across organizations

https://w3c-cg.github.io/rdfsurfaces/

Show thread

**Johannes Hoffart** · Jan 03, 2023, 05:05

Johannes Hoffart boosted

**Jacob Eisenstein** @jacobeisenstein@mastodon.social · Jan 03, 2023, 05:05

Jan 03, 2023, 05:05

Jacob Eisenstein @jacobeisenstein@mastodon.social

Very interesting essay on LLMs, their limitations, and their future by @yoavgo!

https://gist.github.com/yoavg/59d174608e92e845c8994ac2e234c8a9

**Johannes Hoffart** · Jan 02, 2023, 15:22

Johannes Hoffart boosted

**Sebastian Raschka** @SebRaschka@mastodon.social · Jan 02, 2023, 15:22

Jan 02, 2023, 15:22

Sebastian Raschka @SebRaschka@mastodon.social

The latest issue of 'Ahead of AI' is now available!

This edition covers my top 10 papers of the year, as well as trends in the AI industry, notable developments in open source projects, and my personal yearly review routine.

Check it out at the link below and have a happy new year!

https://magazine.sebastianraschka.com/p/ahead-of-ai-4-a-big-year-for-ai

**Johannes Hoffart** · Dec 29, 2022, 21:01

Johannes Hoffart boosted

**Lucas Beyer** @lb@sigmoid.social · Dec 29, 2022, 21:01

Dec 29, 2022, 21:01

Lucas Beyer @lb@sigmoid.social

How good of a BERT can one get in ONE DAY on ONE GPU?

With all the recent studies about scaling compute up, this paper takes a refreshing turn and does a deep dive into scaling down compute.

It's well written, stock full of insights. Here is my summary and my opinions.

https://arxiv.org/abs/2212.14034

🧶 1/N

7ca622cee9947b3a.png

**Johannes Hoffart** · Dec 30, 2022, 08:42

Johannes Hoffart boosted

**fsasaki** @fsasaki@qoto.org · Dec 30, 2022, 08:42

Dec 30, 2022, 08:42

fsasaki @fsasaki@qoto.org

#xml has #xproc for building pipelines. What is the counterpart for #rdf? Any pointers & ideas would be very welcome.

**Johannes Hoffart** · Dec 30, 2022, 09:23

Johannes Hoffart boosted

**Hans-Peter Zorn** @data_hpz@sigmoid.social · Dec 30, 2022, 09:23

Dec 30, 2022, 09:23

Hans-Peter Zorn @data_hpz@sigmoid.social

With the advent of #ChatGPT, everyone is talking about large language models. But how do they work? Initially, such models were trained to complete sentences.

But they exhibit exciting capabilities that can be invoked by feeding them "prompts."

Read our Prompt Engineering Guide for a quick overview of the current state of this field.

#nlproc #gpt #llm
https://www.inovex.de/de/blog/prompt-engineering-guide/

**Johannes Hoffart** · Dec 26, 2022, 18:24

Johannes Hoffart boosted

**Sebastian Raschka** @SebRaschka@mastodon.social · Dec 26, 2022, 18:24

Dec 26, 2022, 18:24

Sebastian Raschka @SebRaschka@mastodon.social

Scikit-learn 1.2 is out: https://github.com/scikit-learn/scikit-learn/releases/tag/1.2.0

Was an eventful December & I totally missed the new release of my favorite #machinelearning library!

My personal highlights are around the HistGradientBoostingClassifier (if you haven't used it yet, it's a LightGBM impl that works really well)

It now supports

1. interaction constraints (in trees, features that appear along a particular path are considered as "interacting")
2. class weights
3. feature names for categorical features

**Johannes Hoffart** · Dec 23, 2022, 21:14

Johannes Hoffart boosted

**Gerard de Melo** @gdm@mastodon.social · Dec 23, 2022, 21:14

Dec 23, 2022, 21:14

Gerard de Melo @gdm@mastodon.social

😮 Exciting times:

Surprised to see a #ChatGPT style AI model integrated with Web search so soon!

The new #YouChat provides links to sources, but just like other AI models also makes many mistakes.

Will be interesting to see how people use it.

https://you.com/search?q=what+was+the+recent+breakthrough+in+fusion+research%3F

#AI #NLProc #NLP #IR

cf14869ca45ac519.jpeg

**Johannes Hoffart** · Dec 23, 2022, 20:04

Johannes Hoffart boosted

**GanWeaving** @WeavingWithAI@sigmoid.social · Dec 23, 2022, 20:04

Dec 23, 2022, 20:04

GanWeaving @WeavingWithAI@sigmoid.social

I asked #chatGPT for 4 visual descriptions involving technology from the book 'Snowcrash' (so insane that you can now ask for stuff like that!?). I then copy-pasted them into Midjourney. Here are some results.

#midjourney #midjourneyV4 #aiart #aiartist #aiartcommunity

**Johannes Hoffart** · Dec 20, 2022, 12:37

Johannes Hoffart boosted

**Sebastian Raschka** @SebRaschka@mastodon.social · Dec 20, 2022, 12:37

Dec 20, 2022, 12:37

Sebastian Raschka @SebRaschka@mastodon.social

Hey, I am just signed up a few days ago and want to introduce myself.
I am a #machinelearning researcher focusing on deep neural nets. My passion is sharing all kinds of stuff about machine learning & open source. (Some of you may know me from my books “Python Machine Learning” and “Machine Learning with PyTorch and Scikit-Learn”.)
I love to teach others, and am currently working as Lead AI Educator at Lightning AI, and also an Assistant Prof of Statistics at the University of Wisconsin-Madison.

**Johannes Hoffart** · Dec 19, 2022, 21:05

Johannes Hoffart boosted

**Matthew Honnibal** @honnibal@sigmoid.social · Dec 19, 2022, 21:05

Dec 19, 2022, 21:05

Matthew Honnibal @honnibal@sigmoid.social

We've been working on new https://prodi.gy workflows that let you use the OpenAI API to kickstart your annotations, via zero- or few-shot learning. We've just published the first recipe, for NER annotation 🎉 https://github.com/explosion/prodigy-openai-recipes

Here's what, why and how. 🧵

Let's say you want to do some 'traditional' NLP thing, like extracting information from text. The information you want to extract isn't on the public web — it's in this pile of documents you have sitting in front of you.

d5e8f2881ad921ab.mp4

**Johannes Hoffart** · Dec 20, 2022, 19:18

Johannes Hoffart boosted

**Brewster Kahle** @brewsterkahle@mastodon.archive.org · Dec 20, 2022, 19:18

Dec 20, 2022, 19:18

Brewster Kahle @brewsterkahle@mastodon.archive.org

Please donate to the Internet Archive if you can.
https://archive.org/donate

We are a bargain! Serving millions every day with books music video and web archives.

Please help keep everything freely available.

4b5b04bc477a4efd.jpg

**Johannes Hoffart** · Dec 19, 2022, 07:24

Johannes Hoffart boosted

**Olcan** @olcan@mas.to · Dec 19, 2022, 07:24

Dec 19, 2022, 07:24

Olcan @olcan@mas.to

great post about chatgpt, reviewing everything known about how its emergent abilities might have come about https://yaofu.notion.site/How-does-GPT-Obtain-its-Ability-Tracing-Emergent-Abilities-of-Language-Models-to-their-Sources-b9a57ac0fcf74f30a1ab9e3e36fa1dc1

e22e2e28727d1c35.png

**Johannes Hoffart** · Dec 13, 2022, 02:28

Johannes Hoffart boosted

**Ben Lorica 罗瑞卡** @bigdata@indieweb.social · Dec 13, 2022, 02:28

Dec 13, 2022, 02:28

Ben Lorica 罗瑞卡 @bigdata@indieweb.social

Introducing: LAION 5B, a large-scale dataset for research purposes consisting of 5,85B CLIP-filtered image-text pairs. 2,3B contain English language, 2,2B samples from 100+ other languages
#OpenData #MachineLearning
https://laion.ai/blog/laion-5b/

Website: https://www.hoffart.ai

CTO, AI at SAP - #nlproc #kg #ai (views are my own and do not reflect those of my employer)

Joined Sep 2019

Johannes Hoffart @johannes@qoto.org

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…