# In what order are top_p and temperature applied if both are specified? Is the distribution of output tokens first squished (temperature) and then truncated and renormalized (top_p), or is it first truncated and renormalized and then squished?