Definitely going to wait for the patch for LLaMa 2 70B model, even with 4bit quantization, I still want to see what this 70B model can do.
The 13B is really amazed me. It's 10% parameters of ChatGPT, but it's free with (almost) no restrictions (please do not use it for ilegal reasons), and can run on my own laptop.