BUT: I don't take feedback. When someone sends a pull requests, I ignore it.
My view: LLMs are general purpose and more capable than SLMs. They'll win, like CPUs won over special-purpose chips. GPUs will optimize for LLMs and as usage grows, cost will fall. #gpu#llm-ops