Memory Management Techniques in Large Language Models