Java.lang.outofmemoryerror GC Overhead Limit

[Performance]: Use int over list[int] as output_tokens to reduce GC overhead

Currently, we're consistently using list[int] to represent output_tokens in ModelRunnerOutput which is very inefficient from GC prospective. The default setup of GC is (700, 10, 10) which means if ...

Cuireadh roinnt torthaí i bhfolach toisc go bhféadfadh siad a bheith dorochtana duit

Taispeáin torthaí dorochtana

Aiseolas

[Performance]: Use int over list[int] as output_tokens to reduce GC overhead

Ag Treochtáil anois