Technical Writer
Technical Writer
Transitioning from using ML models via closed source APIs to open source ML models? This checklist provides all necessary resources for the shift.
Learn if LLM inference is compute or memory bound to fully utilize GPU power. Get insights on better GPU resource utilization.
Pin versions of open source packages like PyPi's transformers to avoid breaking changes or security issues; similarly, pin model revisions for stability.
Text embedding models convert text into semantic vectors. Numerous open source models cater to search, recommendation, classification & LLM-augmented retrieval.
Jina AI released jina-embeddings-v2-base-en, a text embedding model that matches OpenAIβs ada-002 model in both benchmark performance and context window length.
This article compares two popular GPUsβthe NVIDIA A10 and A100βfor model inference and discusses the option of using multi-GPU instances for larger models.
SDXL 1.0 initially takes 8-10 seconds for a 1024x1024px image on A100 GPU. Learn how to reduce this to just 1.92 seconds on the same hardware.
Llama 2 rivals GPT-3.5 in quality and powers ChatGPT. Chainlit helps build ChatGPT-like interfaces. This guide shows creating such interfaces with Llama 2.