A note on the importance of prompt and template formatting - as seen from starcoder

noneabove1182@sh.itjust.works · 11 months ago

A note on the importance of prompt and template formatting - as seen from starcoder

micheal65536@lemmy.micheal65536.duckdns.org · 11 months ago

I have also encountered “rate limits” where the request is not dropped/errored out but is simply stalled until the timeout expires.

Usually this happens in a client library though rather than over the network itself, where the library blocks the thread until it knows that the rate-limit is due to expire before issuing the request to a server (and then blocks and reissues again if the server still returns a rate-limit error). This allows the application developer to know that their request will complete “at some point” rather than having to handle the error and timeout themselves. Usually this is preferred in single-threaded application, or one where all the API stuff happens on a single thread (i.e. one request at a time, no new request is issued until the previous request has completed).