currently only the sync vllm worker has nvtx markers. this issue tracks adding this to the async llm engine for parity
currently only the sync vllm worker has nvtx markers. this issue tracks adding this to the async llm engine for parity