MAX_CONTEXT_UTILIZATION

Constant MAX_CONTEXT_UTILIZATION 

Source
pub const MAX_CONTEXT_UTILIZATION: f32 = 0.75;
Expand description

Maximum percentage of context window to use in a single request