also, can it do grounding like cogvlm?
either way, great job!
The hope is to be able to get more multimodal models out soon. I'd like to see if we can get Pixtral and Qwen2.5-vl in relatively soon.
Is there any more specific info available about who (llama.cpp or Ollama) removed what, where? As far as I can see, the server is still part of llama.cpp.
And more generally: Is this the moment when Ollama and Llama part ways?
How does this address the security concern of filenames being detected and read when not wanted?