Changes for adding gemma model config by vavarshn · Pull Request #90 · splunk/splunk-ai-operator

vavarshn · 2026-05-13T11:04:46Z

Summary

This MR updates the AI Tier SAIA model deployment config to use Gemma 4 as the primary hosted SAIA v2 model while keeping GPT-OSS 20B available for supporting flows.
Changes include:

Replace the GptOss120b Ray Serve application with Gemma431bIt (gemma4_31b_it).
Keep GptOss20b deployed for field descriptions, conversation titles, and metadata-description paths used by SAIA service.
Update SAIA feature replica defaults to scale Gemma431bIt and GptOss20b.
Update the Ray builder config test to expect Gemma431bIt + GptOss20b as the text generation apps.
Update k0s quickstart model artifact documentation from gpt-oss-120b to gemma-4-31b-it.

Why

SAIA v2 hosted model selection now defaults to Gemma 4 when use_gpt_oss_120b is false or missing. This operator change makes the corresponding Gemma 4 Ray endpoint available in AI Tier and removes the GPT-OSS 120B app from the local deployment config.
GPT-OSS 20B is intentionally retained because saia-service still uses it for auxiliary CMP flows such as field descriptions, title generation, and metadata descriptions.

…IA operator

kupratyu-splunk approved these changes May 13, 2026

View reviewed changes

vvarshney-splunk added 2 commits May 21, 2026 11:34

Changes for adding gemma model config

a099797

feat: configure Gemma 4 31B for 2-GPU L40S and add LLM defaults to SA…

7c0722d

…IA operator

vavarshn force-pushed the vvarshney/gemma4_ai_tier branch from 7952ecc to 7c0722d Compare May 21, 2026 06:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes for adding gemma model config#90

Changes for adding gemma model config#90
vavarshn wants to merge 2 commits into
mainfrom
vvarshney/gemma4_ai_tier

vavarshn commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vavarshn commented May 13, 2026

Summary

Why

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants