Information theory (2) 썸네일형 리스트형 [논문이해] Compressing Context to Enhance Inference Efficiency of Large Language Models 논문명: Compressing Context to Enhance Inference Efficiency of Large Language Models논문 링크: https://arxiv.org/abs/2310.06201 Compressing Context to Enhance Inference Efficiency of Large Language ModelsLarge language models (LLMs) achieved remarkable performance across various tasks. However, they face challenges in managing long documents and extended conversations, due to significantly increased co.. [논문이해] locally typical sampling 논문명: Locally Typical Sampling 논문링크: https://arxiv.org/abs/2202.00666 Locally Typical Sampling Today's probabilistic language generators fall short when it comes to producing coherent and fluent text despite the fact that the underlying models perform well under standard metrics, e.g., perplexity. This discrepancy has puzzled the language generation arxiv.org 수학적 증명과 이해는 건들지 않는다 논문에 수학적인 증명과 이해가 .. 이전 1 다음