ARM asm for AV_RN*()

ARMv6 and later support unaligned loads and stores for single word/halfword but not double/multiple. GCC is ignorant of this and will always use bytewise accesses for unaligned data. Casting to an int32_t pointer is dangerous since a load/store double or multiple instruction might be used (this happens with some code in FFmpeg). Implementing the AV_[RW]* macros with inline asm using only supported instructions gives fast and safe unaligned accesses. ARM RVCT does the right thing with generic code. This gives an overall speedup of up to 10%. Originally committed as revision 18601 to svn://svn.ffmpeg.org/ffmpeg/trunk
2009-04-18 00:00:28 +00:00
parent a6783b8961
commit 3c55ce039d
2 changed files with 81 additions and 0 deletions
--- a/libavutil/intreadwrite.h
+++ b/libavutil/intreadwrite.h
@ -29,6 +29,9 @@
 * defined, even if these are implemented as inline functions.
 */

+#if   ARCH_ARM
+#   include "arm/intreadwrite.h"
+#endif

 /*
 * Define AV_[RW]N helper macros to simplify definitions not provided