According to the first wave models, there are 4 vision models including SqueezeNet, MobileNetV2, ResNetV2 and TinyYOLOV2.
To support those models, there are 5 ops need to be implemented. They are batchNormalization, clamp, concat, gemm and leakyRelu.
These ops are supported by webnn-polyfill. It would be good to keep them aligned.