llama.cpp Quickstart with CLI and Server
#Cheatsheet #GGUF #AI #LLM #DevOps #OpenAI #API #SelfHosting #CUDA #Prometheus #llama.cpp
https://www.glukhov.org/llm-hosting/llama-cpp/
llama.cpp Quickstart with CLI and Server
#Cheatsheet #GGUF #AI #LLM #DevOps #OpenAI #API #SelfHosting #CUDA #Prometheus #llama.cpp
https://www.glukhov.org/llm-hosting/llama-cpp/
Rust vs Python for AI Development: A Comprehensive Comparison
#Rust #Python #AI development #machine learning #performance comparison
https://dasroot.net/posts/2026/02/rust-vs-python-ai-development-comparison/
OpenCode Quickstart: Install, Configure, and Use the Terminal AI Coding Agent
#Cheatsheet #ai-devtools #coding-agents #terminal #developer-tools #llm-tools #LLM #AI #AI Coding #Dev #DevOps
https://www.glukhov.org/ai-devtools/opencode/
Rust and WebAssembly for AI Interfaces: A 2026 Perspective
#Rust #WebAssembly #AI Interfaces #Monty #wasm-pack
https://dasroot.net/posts/2026/02/rust-webassembly-ai-interfaces-2026/
Airtable for Developers & DevOps - Plans, API, Webhooks, and Go/Python Examples
#Cloud #Hosting #Dev #DevOps #Go #Golang #Python #Integration #AI #API
https://www.glukhov.org/data-infrastructure/integrations/airtable-for-developers-and-devops/
Comparing LLMs performance on Ollama on 16GB VRAM GPU
#LLM #Ollama #NVidia #Hardware #Self-Hosting #Open Source #DeepLearning #AI
https://www.glukhov.org/llm-performance/benchmarks/choosing-best-llm-for-ollama-on-16gb-vram-gpu/
LLM Performance and PCIe Lanes: Key Considerations
#Self-Hosting #LLM #Performance #AI #Ollama #Hardware #DeepLearning
https://www.glukhov.org/llm-performance/hardware/llm-performance-and-pci-lanes/
Terminal Multiplexers: tmux vs Zellij β A Comprehensive Comparison
#tmux #Zellij #terminal multiplexer #DevOps tools #command line interface
https://dasroot.net/posts/2026/02/terminal-multiplexers-tmux-vs-zellij-comparison/
Search vs Deepsearch vs Deep Research
#Cloud #LLM #AI #Perplexica
https://www.glukhov.org/rag/architecture/search-vs-deepsearch-vs-deep-research/
Markdown Code Blocks: Complete Guide with Syntax, Languages & Examples
#Hugo #Cheatsheet #Markdown
https://www.glukhov.org/documentation-tools/markdown/markdown-codeblocks/
Markdown Cheatsheet: Syntax, Formatting & Structure Quick Reference
#Hugo #Cheatsheet #Markdown
https://www.glukhov.org/documentation-tools/markdown/markdown-cheatsheet/
Monitor LLM Inference in Production (2026): Prometheus & Grafana for vLLM, TGI, llama.cpp
#Monitoring #Hosting #Self-Hosting #LLM #AI #DevOps #Docker #K8S #Prometheus #Grafana #observability #kubernetes #vllm
https://www.glukhov.org/observability/monitoring-llm-inference-prometheus-grafana/
Docker Model Runner vs Ollama (2026): Which Is Better for Local LLMs?
#Docker #Ollama #LLM #AI #DevOps #Self-Hosting #Linux #API #NVidia
https://www.glukhov.org/llm-hosting/comparisons/docker-model-runner-vs-ollama-comparison/
Ollama vs vLLM vs LM Studio: Best Way to Run LLMs Locally in 2026?
#LLM #AI #Ollama #vllm #Privacy #Open Source #Self-Hosting #Docker #API #Machine Learning #RAG
https://www.glukhov.org/llm-hosting/comparisons/hosting-llms-ollama-localai-jan-lmstudio-vllm-comparison/
OpenClaw Quickstart: Install with Docker (Ollama GPU or Claude + CPU)
#Hosting #Self-Hosting #LLM #AI #Ollama #Docker #Open Source #RAG #OpenClaw
https://www.glukhov.org/ai-systems/openclaw/quickstart/
Garage vs MinIO vs AWS S3: Object Storage Comparison and Feature Matrix
#Minio #Garage #S3 #AWS #Hosting #Self-Hosting #DevOps #Open Source
https://www.glukhov.org/data-infrastructure/object-storage/garage-vs-minio-vs-s3/
Implementing Workflow Applications with Temporal in Go: A Complete Guide
#Go #Golang #devops #coding #LLM #Architecture #AI Coding #Dev #Open Source
https://www.glukhov.org/post/2026/03/workflow-applications-temporal-in-go/
Garage - S3 compatible object storage Quickstart
#Self-Hosting #s3 #object-storage #self-hosted #backup #observability
https://www.glukhov.org/data-infrastructure/object-storage/garage-quickstart/
Observability for LLM Systems: Metrics, Traces, Logs, and Testing in Production
#LLM #Prometheus #Grafana #Kubernetes #Monitoring #AI #DevOps #Hosting
https://www.glukhov.org/observability/observability-for-llm-systems/
Using Go to Build RAG Systems: WeKnora Deep Dive
#Go #RAG #WeKnora #Agent Skills #Hybrid Retrieval
https://dasroot.net/posts/2026/02/using-go-build-rag-systems-weknora-deep-dive/
Chunking Strategies in RAG Comparison: Alternatives, Tradeβoffs, and Examples
#RAG #Vector Databases #LLM Performance #DevOps #Hardware #Python #LLM #AI #AI Coding #API #Dev #Coding
https://www.glukhov.org/rag/retrieval/chunking-strategies-in-rag/
Ollama CLI Cheatsheet: ls, serve, run, ps + commands (2026 update)
#Linux #Cheatsheet #Self-Hosting #LLM #AI #Ollama #DevOps #Python
https://www.glukhov.org/llm-hosting/ollama/ollama-cheatsheet/
Writing High-Throughput Network Clients in Go
#Go #network clients #high-throughput #gRPC #HTTP/2
https://dasroot.net/posts/2026/02/writing-high-throughput-network-clients-go/
Running LLMs Locally for Data Privacy
#LLM #NVIDIA GPU #Google TPU #PyTorch #Hugging Face Transformers #Model Quantization #Data Privacy #Secure Communication #Access Control #TLS 1.3
https://dasroot.net/posts/2026/02/running-llms-locally-data-privacy/
How to Configure Desktop Launchers on Ubuntu 24 with Standard Icons
#Linux #Cheatsheet #bash #Dev #Howtos
https://www.glukhov.org/post/2026/02/configure-desktop-launchers-ubuntu-24/
Agentic AI and Security: A Deep Technical Analysis in 2026
#Agentic AI #AI Security #OWASP AIVSS #MAESTRO #Observability
https://dasroot.net/posts/2026/02/agentic-ai-security-deep-technical-analysis-2026/
Ansible vs Puppet vs Chef vs SaltStack: Configuration Management Tool Comparison
#Ansible #Puppet #Chef #SaltStack #Configuration Management
https://dasroot.net/posts/2026/02/ansible-vs-puppet-vs-chef-vs-saltstack-configuration-management/
Retrieval-Augmented Generation (RAG) Tutorial: Architecture, Implementation, and Production Guide:
www.glukhov.org/rag/
#AI #LLM #RAG #Embeddings #Reranking #VectorDatabase
Observability: Monitoring, Metrics, Prometheus & Grafana Guide:
www.glukhov.org/observability/
#Monitoring #Observability #Prometheus #Grafana #Kubernetes #DevOps
API-First Development and Contract Testing: Modern Practices and Tools
#API-First Development #Contract Testing #OpenAPI #Pact #Microservices
https://dasroot.net/posts/2026/02/api-first-development-contract-testing/