With the advent of #ChatGPT, everyone is talking about large language models. But how do they work? Initially, such models were trained to complete sentences.

But they exhibit exciting capabilities that can be invoked by feeding them "prompts."

Read our Prompt Engineering Guide for a quick overview of the current state of this field.

#nlproc #gpt #llm
https://www.inovex.de/de/blog/prompt-engineering-guide/

**Johannes Hoffart** · Dec 26, 2022, 18:24

Johannes Hoffart boosted

**Sebastian Raschka** @SebRaschka@mastodon.social · Dec 26, 2022, 18:24

Dec 26, 2022, 18:24

Sebastian Raschka @SebRaschka@mastodon.social

Scikit-learn 1.2 is out: https://github.com/scikit-learn/scikit-learn/releases/tag/1.2.0

Was an eventful December & I totally missed the new release of my favorite #machinelearning library!

My personal highlights are around the HistGradientBoostingClassifier (if you haven't used it yet, it's a LightGBM impl that works really well)

It now supports

1. interaction constraints (in trees, features that appear along a particular path are considered as "interacting")
2. class weights
3. feature names for categorical features

**Johannes Hoffart** · Dec 23, 2022, 21:14

Johannes Hoffart boosted

**Gerard de Melo** @gdm@mastodon.social · Dec 23, 2022, 21:14

Dec 23, 2022, 21:14

Gerard de Melo @gdm@mastodon.social

😮 Exciting times:

Surprised to see a #ChatGPT style AI model integrated with Web search so soon!

The new #YouChat provides links to sources, but just like other AI models also makes many mistakes.

Will be interesting to see how people use it.

https://you.com/search?q=what+was+the+recent+breakthrough+in+fusion+research%3F

#AI #NLProc #NLP #IR

cf14869ca45ac519.jpeg

**Johannes Hoffart** · Dec 23, 2022, 20:04

Johannes Hoffart boosted

**GanWeaving** @WeavingWithAI@sigmoid.social · Dec 23, 2022, 20:04

Dec 23, 2022, 20:04

GanWeaving @WeavingWithAI@sigmoid.social

I asked #chatGPT for 4 visual descriptions involving technology from the book 'Snowcrash' (so insane that you can now ask for stuff like that!?). I then copy-pasted them into Midjourney. Here are some results.

#midjourney #midjourneyV4 #aiart #aiartist #aiartcommunity

**Johannes Hoffart** · Dec 20, 2022, 12:37

Johannes Hoffart boosted

**Sebastian Raschka** @SebRaschka@mastodon.social · Dec 20, 2022, 12:37

Dec 20, 2022, 12:37

Sebastian Raschka @SebRaschka@mastodon.social

Hey, I am just signed up a few days ago and want to introduce myself.
I am a #machinelearning researcher focusing on deep neural nets. My passion is sharing all kinds of stuff about machine learning & open source. (Some of you may know me from my books “Python Machine Learning” and “Machine Learning with PyTorch and Scikit-Learn”.)
I love to teach others, and am currently working as Lead AI Educator at Lightning AI, and also an Assistant Prof of Statistics at the University of Wisconsin-Madison.

**Johannes Hoffart** · Dec 19, 2022, 21:05

Johannes Hoffart boosted

**Matthew Honnibal** @honnibal@sigmoid.social · Dec 19, 2022, 21:05

Dec 19, 2022, 21:05

Matthew Honnibal @honnibal@sigmoid.social

We've been working on new https://prodi.gy workflows that let you use the OpenAI API to kickstart your annotations, via zero- or few-shot learning. We've just published the first recipe, for NER annotation 🎉 https://github.com/explosion/prodigy-openai-recipes

Here's what, why and how. 🧵

Let's say you want to do some 'traditional' NLP thing, like extracting information from text. The information you want to extract isn't on the public web — it's in this pile of documents you have sitting in front of you.

d5e8f2881ad921ab.mp4

**Johannes Hoffart** · Dec 20, 2022, 19:18

Johannes Hoffart boosted

**Brewster Kahle** @brewsterkahle@mastodon.archive.org · Dec 20, 2022, 19:18

Dec 20, 2022, 19:18

Brewster Kahle @brewsterkahle@mastodon.archive.org

Please donate to the Internet Archive if you can.
https://archive.org/donate

We are a bargain! Serving millions every day with books music video and web archives.

Please help keep everything freely available.

4b5b04bc477a4efd.jpg

**Johannes Hoffart** · Dec 19, 2022, 07:24

Johannes Hoffart boosted

**Olcan** @olcan@mas.to · Dec 19, 2022, 07:24

Dec 19, 2022, 07:24

Olcan @olcan@mas.to

great post about chatgpt, reviewing everything known about how its emergent abilities might have come about https://yaofu.notion.site/How-does-GPT-Obtain-its-Ability-Tracing-Emergent-Abilities-of-Language-Models-to-their-Sources-b9a57ac0fcf74f30a1ab9e3e36fa1dc1

e22e2e28727d1c35.png

**Johannes Hoffart** · Dec 13, 2022, 02:28

Johannes Hoffart boosted

**Ben Lorica 罗瑞卡** @bigdata@indieweb.social · Dec 13, 2022, 02:28

Dec 13, 2022, 02:28

Ben Lorica 罗瑞卡 @bigdata@indieweb.social

Introducing: LAION 5B, a large-scale dataset for research purposes consisting of 5,85B CLIP-filtered image-text pairs. 2,3B contain English language, 2,2B samples from 100+ other languages
#OpenData #MachineLearning
https://laion.ai/blog/laion-5b/

**Johannes Hoffart** · Dec 13, 2022, 06:14

Johannes Hoffart boosted

**Hans-Peter Zorn** @data_hpz@sigmoid.social · Dec 13, 2022, 06:14

Dec 13, 2022, 06:14

Hans-Peter Zorn @data_hpz@sigmoid.social

This article sheds light on the question of why machine learning products mostly do not get into production even though they are enjoying an ongoing boom. Additionally, it shows how MLOps can help to tackle these challenges in the machine learning life cycle.

https://www.inovex.de/de/blog/a-conceptual-view-on-the-machine-learning-life-cycle/ #ml #mlops

**Johannes Hoffart** · Dec 04, 2022, 16:40

Johannes Hoffart boosted

**Gerard de Melo** @gdm@mastodon.social · Dec 04, 2022, 16:40

Dec 04, 2022, 16:40

Gerard de Melo @gdm@mastodon.social

Controversial #machinelearning suggestions by Yann LeCun at #NeurIPS2022 Self-Supervised Learning workshop!

He suggests:

(1) abandoning generative AI architectures
(in favour of joint embedding ones)

(2) abandoning probabilistic models
(in favour of energy-based models)

(3) abandoning contrastive methods
(in favour of regularized methods)

(3) abandoning RL where possible
(in favour of model-predictive control)

Related talk:
http://youtu.be/VRzvpV9DZ8Y

Source: https://twitter.com/BeingMIAkashs/status/1599061227665514496

ab5995f9bde90d85.jpg

**Johannes Hoffart** · Dec 05, 2022, 19:35

Johannes Hoffart boosted

**Ben Lorica 罗瑞卡** @bigdata@indieweb.social · Dec 05, 2022, 19:35

Dec 05, 2022, 19:35

Ben Lorica 罗瑞卡 @bigdata@indieweb.social

Monolith (from ByteDance, creator of TikTok) is an interesting system for online training that addresses two problems faced by modern recommenders: (1) Concept Drift - underlying distribution of the training data is non-stationary; ( 2) Features used by models are mostly sparse, categorical and dynamically changing. #recsys #MachineLearning

https://arxiv.org/abs/2209.07663v2

**Johannes Hoffart** · Dec 05, 2022, 20:20

Johannes Hoffart boosted

**Harald Sack** @lysander07@sigmoid.social · Dec 05, 2022, 20:20

Dec 05, 2022, 20:20

Harald Sack @lysander07@sigmoid.social

Like @timfinin I tried ChatGPT on last semester's final exam for my lecture "Information Service Engineering", with questions/tasks on Knowledge Graphs, basic NLP, and basic ML. It performed surprisingly well (for SPARQL it achieved 11 out of 12 points). Even for more complex questions like performing an evaluation or constructing an FSA, it performed not flawlessly, but not so bad. Overall, ChatGPT would have passed. Congratulations!

#ChatGPT #NLP #FIZISE #ML #knowledgegraph #SPARQL

26afe706c62438d2.png

**Johannes Hoffart** · Dec 02, 2022, 16:03

Johannes Hoffart boosted

**Mark Riedl** @Riedl@sigmoid.social · Dec 02, 2022, 16:03

Dec 02, 2022, 16:03

Mark Riedl @Riedl@sigmoid.social

Here is a twitter thread on ways discovered to "jailbreak" #ChatGPT

1. Pretend to be evil
2. Remind it that it isn't supposed to disagree
3. Wrap it in code
4. Tell GPT to be in opposite mode
5. Convince GPT it is playing an earthlike game
6. Convince it to give examples of what LLMs shouldn't do
https://twitter.com/zswitten/status/1598380220943593472

Show thread

**Johannes Hoffart** · Nov 30, 2022, 21:47

Johannes Hoffart boosted

**Pascal Hitzler** @pascalhitzler@mstdn.social · Nov 30, 2022, 21:47

Nov 30, 2022, 21:47

Pascal Hitzler @pascalhitzler@mstdn.social

We're starting a community slack for anybody interested in Neurosymbolic AI. (drivers include organizers of the annual workshop on the topic, and EiCs and EB members of the journal Neurosymbolic Artificial Intelligence that we're currently starting.
If you'd like to be on the slack, let me know (or anybody else who's already on it). You'll get an invite to your email then.

**Johannes Hoffart** · Nov 29, 2022, 19:44

Johannes Hoffart boosted

**Daniel Hernandez** @danielhz@mastodon.social · Nov 29, 2022, 19:44

Nov 29, 2022, 19:44

Daniel Hernandez @danielhz@mastodon.social

#Wikidata random #SPARQL query: Universities ordered by number of #Mastodon IDs of people who studied there. https://w.wiki/63XG

**Johannes Hoffart** · Nov 22, 2022, 22:10

Johannes Hoffart boosted

**Scott Hanselman 👸🏽🐝🌮** @shanselman@hachyderm.io · Nov 22, 2022, 22:10

Nov 22, 2022, 22:10

Scott Hanselman 👸🏽🐝🌮 @shanselman@hachyderm.io

This is MASSIVE. The Windows Subsystem for Linux in the Microsoft Store is now generally available on Windows 10 and 11! Windows 10 users can now run Linux GUI apps natively! https://devblogs.microsoft.com/commandline/the-windows-subsystem-for-linux-in-the-microsoft-store-is-now-generally-available-on-windows-10-and-11/ #wsl #windows #linux

**Johannes Hoffart** · Nov 28, 2022, 18:20

Johannes Hoffart boosted

**Mathieu Triclot** @mtriclot@aleph.land · Nov 28, 2022, 18:20

Nov 28, 2022, 18:20

Mathieu Triclot @mtriclot@aleph.land

The full paper is here with this telling chart: https://cdn.cms-twdigitalassets.com/content/dam/blog-twitter/official/en_us/company/2021/rml/Algorithmic-Amplification-of-Politics-on-Twitter.pdf

The experimental setup is interesting : Twitter deliberately excluded 1% of its 2016 users from the algorithmic timeline. These users serve as a control group to measure the effect of the algorithm.

The results surprised the study's authors, who expected an increase in amplification at the extremes, both on the right and the left. However, only the right is amplified. This is further proof that political representation with a center and extremes does not reflect the structure of the field.

aa6c03a6748bf25d.png

Show thread

Show older

Website: https://www.hoffart.ai

CTO, AI at SAP - #nlproc #kg #ai (views are my own and do not reflect those of my employer)

Joined Sep 2019

Johannes Hoffart @johannes@qoto.org

Resources

Developers

What is Mastodon?

qoto.org

More…