Large language model inference optimizations on AMD GPUs

AMD ROCm Software Blogs