The MSA optimization has been refined in commit93218c2andce0a52e. It is better than MMI version now. Speed of decoding H264: 4.83x ==> 4.89x (tested on 3A4000). Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
2.0 KiB
2.0 KiB