Rick(Rugui) CheninGoogle Cloud - CommunityDistributed OpenSource LLM Fine-Tuning with LLaMA-Factory on GKETLDR:Jul 8Jul 8
Rick(Rugui) CheninGoogle Cloud - CommunityStreamline AI ML Model Development on GKE with Skypilot and Vertex AI WorkbenchIntroductionMay 28May 28
Rick(Rugui) CheninGoogle Cloud - CommunityServe Sentence Transformer Embedding Models in GKE AutopilotEmbeddings: The Key to Unlocking LLM & RAG PotentialMay 18May 18
Rick(Rugui) CheninGoogle Cloud - CommunityHigh-performance Stable Diffusion XL Inference on GKE and TPU v5e with MaxDiffusionIntroductionApr 28Apr 28
Rick(Rugui) CheninGoogle Cloud - CommunityLLM&FinOps: Cost Optimization Options to Run High Performance AI/ML Workloads on GKE in Google…1. Introduction & The Cost ChallengeMar 24Mar 24
Rick(Rugui) CheninGoogle Cloud - CommunityUse Google Managed Prometheus and Triton Inference Server on GKE to Simplify LLM observability and…BackgroundMar 13Mar 13
Rick(Rugui) CheninGoogle Cloud - CommunityExplore Duet AI to assist with React frontend appldevelopmentIntroductionFeb 23Feb 23
Rick(Rugui) CheninGoogle Cloud - CommunityServing Open Source LLMs on GKE using vLLM frameworkIntroductionFeb 123Feb 123