* support top_k_top_p sampling * fix * add api param * add api para * fix * fix * fix * fix * fix * fix * fix