#markdown
Section collapsed - expand with expand_section("{identifier}").
#llm-ops
#automation
#best-practices
#llm-ops
#optimization
#prompt-engineering
#future
#llm-ops
us.meta.llama3-2-11b-instruct-v1:0
if the model is in a US region. #llm-ops
SentenceTransformer.encode(docs)
it's best if we embed with smaller docs
and call it multiple times (rather than embedding more at once). On Colab T4, for gte-base-en-v1.5
, when embedding 1,000 docs of up to 8K chars each, here is the TOTAL time it took, based on batch sizes (lower is better) #embeddings
#embeddings
#gpu
**/*.md
can search for all Markdown files. Julia Evans #markdown
#cloud
#future