Due to recent changes to support k/v caching, we need to initialize the attention class with a layer index
· Sign up or log in to comment