Class to handle a large language model on top of onnxruntime
Protected
Optional
Generate tokens using greedy search
Initial tokens
Callback function to handle the generated tokens
Generation options
Array of generated tokens
Class to handle a large language model on top of onnxruntime