Re: LLM Experiments, Part 1: Corrections

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: LLM Experiments, Part 1: Corrections

From:	Andrew Hyatt
Subject:	Re: LLM Experiments, Part 1: Corrections
Date:	Mon, 22 Jan 2024 16:21:26 -0400
User-agent:	Gnus/5.13 (Gnus v5.13)

On 22 January 2024 21:57, Psionic K <psionik@positron.solutions>wrote:> I think things have to be synchronous here.Snapshot isolation is the best strategy for merging here. Wedon'tknow what user commands affected the region in question, sousing undo states to merge might need to undo really arbitraryuser commands. To snapshot isolate, basically you store acopy of the buffer text and hold two markers where that textwas. You can merge the result if it arrives on time and thendiff the snapshot with the buffer text between the markers.If things are too different for a valid merge, you can give upand drop the results. These days various CRDT (conflict-freereplicated data type) treatments have great insights intodealing with much worse problems of multiple asynchronouswriters, and it's a good place to look. There is a crdt.elpackage for some inspiration.


This is a good tip, thank you.

But definitely not synchronous.I think the changing of the text out from under you is just oneproblem to solve, but the other is that we start some llm-poweredworkflow, then the user is free to do whatever they want for justlike 10 seconds. It requires both us and the user to do more - wewould need to communicate to the user that something is awaitingtheir input. Then the user would need to run a command to go backto the experience we want to put them in (in this case, an ediffsession). It's a bit weird, and I think it's a bit toocomplicated. I'm still leaning to the synchronous side, but it'sworth trying out an async solution and seeing just how bad it is.

As a package author, I would want to treat my LLM like a fancyprocess. I create it, I handle results. I have a mergingstrategy (this is mainly up to the client, not the library),but I don't care about the asynchronous details and I don'twant to be tied to each call.

The LLM library does work like this already. It has async methodsthat have callbacks. This is about higher-level functionality.

> Question 6A rock solid library that sticks to the domain model is bestfor ecosystem growth. When that doesn't happen, we get fouror five 75% finished packages because every author is havingto figure out and integrate their high level features with somany backends. If you want to work on high level things,build a client for your library and experience both sides.

Totally agree, and that's what I've started and will continue todo with these demos, which inform the development of the llm-flowslayer I'm building.

Every model will have some mostly static configuration,dynamic arguments that could change all the time but inpractice change just a few times, and then the input data.The static configuration, if absolutely necessary, can beupdated for one call via dynamic binding. The dynamicarguments should be abstracted into a "context" object thatthe model backend figures out how to translate into a validAPI call. The input data is an arbitrary bundle of whateverthat model type consumes as input. The library user will wantto get a valid context of the dynamic arguments from thelibrary, enabling them to make changes to it in subsequentcalls, but they don't really want to touch it that much. Asa package author, I would want to focus on integrating outputsand piping in inputs. I don't want to write a UI for tuningthe model parameters. If the model can ask the user to makeadjustments and just give me a record of their decision I canuse later, that would be fantastic. I should be able tointegrate more closely with backends I know about butotherwise just call with the provided context and my inputs.

Agreed, such adjustments should be part of a common layer.

Providers offer multiple models. As a library user, it'sinconvenient if I have to go through long incantations to geteach context that represents the capability to make validcalls for the provider. I want to initialize once and thenuse an existing context to pull out the correct context basedon the input or output type I need, and then make refinementsthat are specific to a call, such as changing quality orentropy etc. Input or output type and settings that tune thecall are two different things. Settings are mostlyprovider-specific argument data that doesn't affect thevalidity of connecting one model to another. Input and outputtype affect which pipes can be connected to which other pipes.This distinction between input or output types and otherarguments become important in composition. I should be ableto connect any string to string model with any other modelthat handles strings no matter what the other settings are.I think we're on the same page here. Anything for quality tuningshould be generic, I hope - perhaps a knob on quality to pricetradeoff that can be used to many things, including understandinghow much context to provide. The rest is already generic in thellm package.

Integrating these systems will be more like distributedstreaming programming than feeding inputs to a GPU with tightsynchronization and everything under our watch, although alocal model might work that way inside its own box. We shouldtreat them like unreliable external services. A call to themodel is a command. When I send a command, I should store howto handle the reply, but I shouldn't couple myself to it withnested callbacks or async, which we fortunately don't haveanyway. The call data just goes into a pile. If the replyshows up and it matches a call, we handle it. If things timeout, we dead-letter and drop the record of making a call.This is a very good way to get around the limitations of theprocess as our main asynchronous primitive for now. It worksfor big distributed services which by their very nature cannotlock each other or share memory. It will work for connectingmany models to each other.I'm not sure I understand this part. Yes, we can have a systemthat stores callbacks in some hashmap or something, and that'sbetter than tying it directly to a specific process. However,something must always be understanding when the process is done,or if it has timed out, and that's the process. I'm not sure howthe centralized storage reduces the coupling to the process. Butif I'm reading this correctly, it seems like an argument for usingstate machines with the centralized storage acting as a driver forstate changes, which may be a good way to think about this.


Thank you for your thorough and thoughtful response!

[Prev in Thread]

Current Thread

[Next in Thread]

Re: LLM Experiments, Part 1: Corrections, (continued)
- Re: LLM Experiments, Part 1: Corrections, João Távora, 2024/01/22
  - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/22
  - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/23
- Re: LLM Experiments, Part 1: Corrections, Karthik Chikmagalur, 2024/01/23
- Re: LLM Experiments, Part 1: Corrections, contact, 2024/01/23
  - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/23
    - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/24
    - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/24
  - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/24
- LLM Experiments, Part 1: Corrections, Psionic K, 2024/01/22
  - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt <=
    - Re: LLM Experiments, Part 1: Corrections, Psionic K, 2024/01/23
    - Re: LLM Experiments, Part 1: Corrections, T.V Raman, 2024/01/23
    - Re: LLM Experiments, Part 1: Corrections, Andrew Hyatt, 2024/01/23

Prev by Date: Re: Patch for ansi-osc.el
Next by Date: Re: Possible minibuffer completion enhancements
Previous by thread: LLM Experiments, Part 1: Corrections
Next by thread: Re: LLM Experiments, Part 1: Corrections
Index(es):
- Date
- Thread