ARM asm for AV_RN*()

ARMv6 and later support unaligned loads and stores for single
word/halfword but not double/multiple.  GCC is ignorant of this and
will always use bytewise accesses for unaligned data.  Casting to an
int32_t pointer is dangerous since a load/store double or multiple
instruction might be used (this happens with some code in FFmpeg).
Implementing the AV_[RW]* macros with inline asm using only supported
instructions gives fast and safe unaligned accesses.  ARM RVCT does
the right thing with generic code.

This gives an overall speedup of up to 10%.

Originally committed as revision 18601 to svn://svn.ffmpeg.org/ffmpeg/trunk
This commit is contained in:
Måns Rullgård
2009-04-18 00:00:28 +00:00
parent a6783b8961
commit 3c55ce039d
2 changed files with 81 additions and 0 deletions

View File

@ -29,6 +29,9 @@
* defined, even if these are implemented as inline functions.
*/
#if ARCH_ARM
# include "arm/intreadwrite.h"
#endif
/*
* Define AV_[RW]N helper macros to simplify definitions not provided