* SYCL: refactor and move cpy kernels to a separate file * Add few missing cpy kernels * refactor and add debug logs