JoeM @jmacc@qoto.org

66 Posts

2 Following

3 Followers

Joined May 2023

2 Following 3 Followers

Posts Posts and replies Media

Show newer

Jul 03, 2023, 08:51

JoeM @jmacc@qoto.org

Idea: train a smaller LLM / classifier which takes input text and produces YES/NO/MAYBE/FAIL answers by generating training data using ChatGPT or another fluent LLM

You can potentially generate training data given a set of input questions; you append the subprompt `(Only give Yes, No, Maybe, or Fail answers. An answer that isn't Yes, No, or Maybe should be Fail)` to each question and feed them into ChatGPT. Its responses (if they match Yes, No, or Maybe; and anything else is implicitly Fail) are the unit-vector outputs to train the new classifer

You could also potentially produce training data by taking random snippets S of text from some large dataset of arbitrary text, and ask ChatGPT: `Given the text "S", please list N questions related to the above text that can be answered with Yes, No, or Maybe, and at the end of each question write their answer (one of: Yes, No, or Maybe)`. Where `N` is some small integer (maybe `5 <= N <= 100`)

This classifier could potentially be used to update a system that is keeping track of how some human-programmable state is evolving when the evolved state is not human-programmable but human-describable: you evolve the system and describe it in text, then ask a finite set of questions to synchronize the programmable state with the new system state description

For example, anyone who played the old AI Dungeon back when it used GPT-2 (and probably still now), or who has played a text adventure using ChatGPT (which is really fun: try it out!), knows that the finite length of the input for those systems means they lose track of information frequently, and there are a lot of small details that are lost in general. A human-programmable text adventure, on the other hand, has limited generality, but has a definitive state. With the above classifier you could potentially make a program with a definitive, human-programmable state, evolve the state using a LLM, then update the human-programmable state with the new state's text-description using the classifier

This same technique might be useful for LLMs themselves to generate notes to augment their memories

**JoeM** @jmacc@qoto.org · Jun 28, 2023, 12:30

**JoeM** @jmacc@qoto.org · Jun 28, 2023, 12:30

Jun 28, 2023, 12:30

JoeM @jmacc@qoto.org

Another stable diffusion controlnet idea:
A module similar to the reference preprocessor but with a text prompt. The prompt controls what the model's attention goes to in the reference image. Presumably this would allow you to reference just one feature of the reference image, and essentially ignore everything else

**JoeM** @jmacc@qoto.org · Jun 22, 2023, 22:09

**JoeM** @jmacc@qoto.org · Jun 22, 2023, 22:09

Jun 22, 2023, 22:09

JoeM @jmacc@qoto.org

Averaging different seeds at the same denoising strength in img2img shows the scale that denoising strength affects

As seen in this video / image: https://imgur.com/a/pz2BCgS

**JoeM** @jmacc@qoto.org · Jun 09, 2023, 14:53

**JoeM** @jmacc@qoto.org · Jun 09, 2023, 14:53

Jun 09, 2023, 14:53

JoeM @jmacc@qoto.org

Idea for stable diffusion: train a model to correct an image which has been randomly deformed. It may be cheap enough to use perlin noise, or similar, to generate random deformations, but otherwise something like GIMP's pick noise, which just randomly exchanges pixels with nearby pixels n times, may be faster

Theoretically, you could use a regular image as the initial noisy image, and the model would then deform it to match what it thinks is the denoised equivalent. This might allow for, eg: correction of anatomical problems for characters, composition problems, etc

**JoeM** @jmacc@qoto.org · Jun 09, 2023, 10:36

**JoeM** @jmacc@qoto.org · Jun 09, 2023, 10:36

Jun 09, 2023, 10:36

JoeM @jmacc@qoto.org

I've been using [obsidian](https://obsidian.md/)'s [canvas](https://obsidian.md/canvas?trk=public_post-text) feature as a sort-of multi-dimensional [kanban board](https://en.wikipedia.org/wiki/Kanban_board) plus calendar and goal timeline and it works amazingly well. It seems like its really important to have the right representation for these sorts of things, and this works really well as a representation for me. I highly recommend checking it out

Joined May 2023

JoeM @jmacc@qoto.org

ai Jul 14, 2023, 11:06

programming Aug 23, 2023, 12:27

machinelearning Jul 14, 2023, 11:06

development Aug 23, 2023, 12:27

ml Jul 14, 2023, 11:06

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…