Update the default Sourcegraph-supplied LLM models#64281

chrsmith · 2024-08-05T20:44:53Z

This PR sets the defaults for "Sourcegraph supplied LLM models".

When will these "defaults" be used?

These models will only be used IFF the Sourcegraph instance is explicitly using the newer "modelConfiguration" site configuration data. (And opts into using Sourcegraph-supplied LLM models.)

If the Sourcegraph instance is using the older "completions" configuration blob, then only the user-supplied models will be used. (Or, based on the specific defaults defined in the code for the completions provider.)

What about Cody Free or Cody Pro?

😬 yeah, we're going to need to deal with that later. Currently Sourcegraph.com is not using the newer "modelConfiguration" site configuration, and instead we have some hacks in the code to ignore the internal modelconfig. See this "super-shady hack":
https://github.com/sourcegraph/sourcegraph/blob/e5178a6bc0e0f2f5208f9dedb0c226661c881d98/cmd/frontend/internal/httpapi/completions/get_model.go#L425-L455

So we are just erring on the side of having Cody Free / Cody Pro "do whatever they do now", and this PR won't have any impact on that.

We do want Sourcegraph.com to only return this data, but there are a few things we need to get straightened out first. (e.g. Cody Gateway being ware of mrefs, and having Cody Clients no longer using dotcom.ts to hard-code Cody Pro LLM models.)

What does this PR actually do?

It updates the code in cmd/cody-gateway-config so that it will produce a new "supported-models.json" file.
I then ran the tool manually, the output of which was then written to internal/modelconfig/embedded/models.json.
That's it.

For any Sourcegraph releases after this PR gets merged, the "Sourcegraph supplied LLM models" will be the newer set defined in models.json. (i.e. having these new defaults, and including "fireworks::v1::starcoder".)

Test plan

~~I tested things locally, and unfortunately it doesn't look like any clients are filtering based on the model capabilities. So "StarCoder" is showing up in the Cody Web UI, despite failing at runtime.~~

Update: This was a problem on my end. This isn't an issue.

Changelog

NA?

emidoots · 2024-08-05T20:49:02Z

What about Cody Pro?

Just FYI, there is also some silly business going on in the clients where we leverage percentage-user-feature-flags to override 'fireworks/starcoder' to 'deepseek V2'... I haven't traced that code path, but something we'll need to figure out. I think it only applies to dotcom, but not 100% positive.

aramaraju

A few small nits that popped up to my eyes

internal/modelconfig/embedded/models.json

emidoots · 2024-08-05T20:51:25Z

cmd/cody-gateway-config/dotcom_models.go

+		newModel(
+			modelIdentity{
+				MRef: mRef(fireworksV1, "starcoder"),
+				// NOTE: THis model name is virutalized.


Suggested change

// NOTE: THis model name is virutalized.

// NOTE: This model name is virtualized.

emidoots · 2024-08-05T20:52:20Z

cmd/cody-gateway-config/main.go

 		//
 		// See internal/version/version.go for reference.
-		Revision: "0.0.0+dev",
+		Revision: "0.0.`0+dev",


this looks weird?

That's a typo, good catch 😅

chrsmith · 2024-08-05T20:52:31Z

What about Cody Pro?
Just FYI, there is also some silly business going on in the clients where we leverage percentage-user-feature-flags to override 'fireworks/starcoder' to 'deepseek V2'... I haven't traced that code path, but something we'll need to figure out. I think it only applies to dotcom, but not 100% positive.

@slimsag not sure about the specific, as there are several places where we could "virtualize" the model name. And/or pick something different via feature flag.

But this is what first comes to mind:
https://github.com/sourcegraph/sourcegraph/blob/e5178a6bc0e0f2f5208f9dedb0c226661c881d98/cmd/cody-gateway/internal/httpapi/completions/fireworks.go#L229

emidoots · 2024-08-05T20:52:54Z

internal/modelconfig/embedded/models.json

 {
  "schemaVersion": "1.0",
-  "revision": "0.0.0+dev",
+  "revision": "0.0.`0+dev",


possibly related

emidoots · 2024-08-05T20:55:37Z

@chrsmith here's a thread to pull on: https://sourcegraph.com/github.com/sourcegraph/cody/-/blob/vscode/src/completions/providers/create-provider.ts?L194-240

(again, no idea what is going on there specifically)

chenkc805 · 2024-08-05T21:40:55Z

cmd/cody-gateway-config/main.go

-			Chat:           types.ModelRef("anthropic::2023-06-01::claude-3-sonnet"),
-			CodeCompletion: types.ModelRef("anthropic::2023-06-01::claude-3-sonnet"),
-			FastChat:       types.ModelRef("anthropic::2023-06-01::claude-3-sonnet"),
+			Chat:           types.ModelRef("anthropic::2023-06-01::claude-3.5-sonnet"),


I see Claude 3.5 Sonnet called anthropic/claude-3-5-sonnet-20240620, according to Gateway telemetry. Flagging in case it's worth making these names consistent (3-5 vs 3.5)

That's a good concern to call out, but the difference is 100% A-OK in this case.

Here we are using the Model ID claude-3.5-sonnet. This will refer to an object with a Model Name of claude-3-5-sonnet-20240620. One of the benefits of using this whole "model config" system is that we are no longer conflating the two. And can treat them as separate values.

So claude-3.5-sonnet can later refer to claude-3-5-sonnet-20240805 without needing to make as many changes across the codebase.

chenkc805

Re: Cody Free/Pro using a super-shady hack:

When will we address this? Because my understanding of the server-side model selection is that it'll be easier to set up new models, so easy in fact that even a PM can do it.

If these experiences bifurcate like what's being described here, will we have achieved this goal? If not, do we have a follow up planned?

chrsmith · 2024-08-05T21:48:44Z

Re: Cody Pro and modelconfig
If these experiences bifurcate like what's being described here, will we have achieved this goal? If not, do we have a follow up planned?

@chenkc805 I will update the tickets in Linear this week to call out the sequence of steps needed so we can make Cody Pro "just work" with the new model config system. (It's not a lot of work, but we do need to be careful when we roll out the changes to avoid any disruptions.)

chrsmith · 2024-08-05T22:04:28Z

@aramaraju PTAL

aramaraju · 2024-08-06T02:24:31Z

@aramaraju PTAL

Looks good to me, Chris! Thank you 🙇

aramaraju

LGTM

Update the default Sourcegraph-supplied LLM models

cd5b393

chrsmith requested review from aramaraju, chenkc805 and emidoots August 5, 2024 20:44

cla-bot bot added the cla-signed label Aug 5, 2024

aramaraju suggested changes Aug 5, 2024

View reviewed changes

internal/modelconfig/embedded/models.json Outdated Show resolved Hide resolved

internal/modelconfig/embedded/models.json Show resolved Hide resolved

emidoots reviewed Aug 5, 2024

View reviewed changes

Address PR feedback

b14b43b

chenkc805 reviewed Aug 5, 2024

View reviewed changes

chrsmith added 2 commits August 5, 2024 14:56

Set fireworks::v1::starcoder token limits

88665b9

Update tests

d21d7ee

aramaraju approved these changes Aug 6, 2024

View reviewed changes

Merge branch 'main' into chrsmith/update-sg-default-models

60038f4

chrsmith merged commit b0bb67b into main Aug 6, 2024

chrsmith deleted the chrsmith/update-sg-default-models branch August 6, 2024 17:26

	// NOTE: THis model name is virutalized.
	// NOTE: This model name is virtualized.

Conversation

chrsmith commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

When will these "defaults" be used?

What about Cody Free or Cody Pro?

What does this PR actually do?

Test plan

Changelog

Uh oh!

emidoots commented Aug 5, 2024

Uh oh!

aramaraju left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

emidoots Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith commented Aug 5, 2024

Uh oh!

emidoots Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots commented Aug 5, 2024

Uh oh!

chenkc805 Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

chenkc805 left a comment

Choose a reason for hiding this comment

Uh oh!

chrsmith commented Aug 5, 2024

Uh oh!

chrsmith commented Aug 5, 2024

Uh oh!

aramaraju commented Aug 6, 2024

Uh oh!

aramaraju left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

chrsmith commented Aug 5, 2024 •

edited

Loading