Martin Storsjö
717cc82d28
aarch64: hevc: Specialize put_hevc_\type\()_h*_8_neon for horizontal looping
...
For widths of 32 pixels and more, loop first horizontally,
then vertically.
Previously, this function would process a 16 pixel wide slice
of the block, looping vertically. After processing the whole
height, it would backtrack and process the next 16 pixel wide
slice.
When doing 8tap filtering horizontally, the function must load
7 more pixels (in practice, 8) following the actual inputs, and
this was done for each slice.
By iterating first horizontally throughout each line, then
vertically, we access data in a more cache friendly order, and
we don't need to reload data unnecessarily.
Keep the original order in put_hevc_\type\()_h12_8_neon; the
only suboptimal case there is for width=24. But specializing
an optimal variant for that would require more code, which
might not be worth it.
For the h16 case, this implementation would give a slowdown,
as it now loads the first 8 pixels separately from the rest, but
for larger widths, it is a gain. Therefore, keep the h16 case
as it was (but remove the outer loop), and create a new specialized
version for horizontal looping with 16 pixels at a time.
Before: Cortex A53 A72 A73 Graviton 3
put_hevc_qpel_h16_8_neon: 710.5 667.7 692.5 211.0
put_hevc_qpel_h32_8_neon: 2791.5 2643.5 2732.0 883.5
put_hevc_qpel_h64_8_neon: 10954.0 10657.0 10874.2 3241.5
After:
put_hevc_qpel_h16_8_neon: 697.5 663.5 705.7 212.5
put_hevc_qpel_h32_8_neon: 2767.2 2684.5 2791.2 920.5
put_hevc_qpel_h64_8_neon: 10559.2 10471.5 10932.2 3051.7
Signed-off-by: Martin Storsjö <martin@martin.st >
2024-03-26 08:58:11 +02:00
..
2024-03-26 08:58:11 +02:00
2024-03-14 01:37:31 +01:00
2024-03-15 12:51:15 +01:00
2024-03-18 01:49:38 +01:00
2024-03-10 23:45:51 +02:00
2024-03-14 22:20:00 +01:00
2024-03-02 02:54:11 +01:00
2024-03-15 15:31:58 +01:00
2024-03-20 22:27:19 +08:00
2024-03-25 21:17:47 +01:00
2024-03-10 19:49:03 +01:00
2024-03-07 08:53:30 -03:00
2024-03-04 02:11:38 +01:00
2024-02-27 00:11:21 +01:00
2024-03-02 02:54:11 +01:00
2024-03-02 02:54:11 +01:00
2024-03-02 02:54:11 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:31 -03:00
2024-03-07 08:53:30 -03:00
2024-03-04 02:11:50 +01:00
2024-03-07 08:53:30 -03:00
2024-03-02 02:54:11 +01:00
2024-03-04 02:11:53 +01:00
2024-03-02 02:54:11 +01:00
2024-03-04 02:11:53 +01:00
2024-03-02 02:54:11 +01:00
2024-03-18 01:49:38 +01:00
2024-03-18 01:49:38 +01:00
2024-03-02 02:54:11 +01:00
2024-02-27 00:11:21 +01:00
2024-02-27 00:11:21 +01:00
2024-02-27 00:11:21 +01:00
2024-03-15 12:51:15 +01:00
2024-02-27 00:11:21 +01:00
2024-02-27 00:11:21 +01:00
2024-02-27 00:11:21 +01:00
2024-02-27 00:11:21 +01:00
2024-03-07 08:53:31 -03:00
2024-03-04 02:11:53 +01:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:31 -03:00
2024-03-07 08:53:31 -03:00
2024-03-07 08:53:31 -03:00
2024-02-12 00:50:46 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-17 15:10:05 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-03-23 18:55:21 +01:00
2024-03-23 18:55:21 +01:00
2024-03-23 18:55:21 +01:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-22 01:45:31 +01:00
2024-02-21 00:24:44 +01:00
2024-03-07 08:53:30 -03:00
2024-03-25 08:54:18 +01:00
2024-03-23 18:54:36 +01:00
2024-03-11 11:40:09 +01:00
2024-02-09 16:14:56 +01:00
2024-03-07 08:53:30 -03:00
2024-03-20 19:15:05 +02:00
2024-02-14 14:53:41 +01:00
2024-02-09 23:20:29 +01:00
2024-03-15 12:51:15 +01:00
2024-03-15 12:51:15 +01:00
2024-03-02 02:47:50 +01:00
2024-02-12 22:00:03 +02:00
2024-02-12 22:00:03 +02:00
2024-03-25 08:54:18 +01:00
2024-03-25 08:32:04 +01:00
2024-02-14 14:53:41 +01:00
2024-02-25 13:42:47 +01:00
2024-02-25 13:42:47 +01:00
2024-03-23 22:33:21 +08:00
2024-02-25 13:42:47 +01:00
2024-02-25 13:42:47 +01:00
2024-02-25 13:42:47 +01:00
2024-02-22 01:42:28 +01:00
2024-03-18 01:49:38 +01:00
2024-02-14 14:53:41 +01:00
2024-03-04 02:11:38 +01:00
2024-03-07 08:53:29 -03:00
2024-03-07 08:53:29 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:31 -03:00
2024-03-07 08:53:30 -03:00
2024-03-08 07:37:56 +01:00
2024-02-14 14:53:41 +01:00
2024-02-09 23:11:52 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-18 07:30:01 +01:00
2024-03-22 20:26:53 +08:00
2024-03-08 07:37:56 +01:00
2024-03-17 15:10:05 +01:00
2024-03-07 08:53:30 -03:00
2024-03-15 12:51:15 +01:00
2024-03-15 12:51:15 +01:00
2024-03-07 08:53:31 -03:00
2024-03-22 14:05:30 +01:00
2024-03-08 07:37:56 +01:00
2024-03-15 12:51:15 +01:00
2024-02-18 07:33:27 +01:00
2024-02-11 00:40:06 +01:00
2024-02-08 10:34:02 +08:00
2024-02-27 23:05:26 +01:00
2024-03-07 08:53:30 -03:00
2024-03-15 12:51:15 +01:00
2024-03-07 08:53:30 -03:00
2024-02-21 00:24:44 +01:00
2024-02-26 16:28:37 +08:00
2024-03-04 17:45:00 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-03-23 11:37:44 +01:00
2024-03-23 11:37:44 +01:00
2024-02-21 00:24:44 +01:00
2024-02-14 14:53:41 +01:00
2024-02-21 00:24:44 +01:00
2024-02-09 08:16:25 +01:00
2024-02-09 08:16:25 +01:00
2024-02-09 16:14:56 +01:00
2024-03-07 08:53:30 -03:00
2024-03-04 17:45:12 +01:00
2024-03-04 17:45:12 +01:00
2024-03-25 20:44:30 +00:00
2024-02-21 00:24:44 +01:00
2024-02-09 08:16:25 +01:00
2024-02-09 08:16:25 +01:00
2024-03-10 13:51:37 +01:00
2024-03-08 07:37:56 +01:00
2024-02-22 01:42:28 +01:00
2024-03-15 12:51:15 +01:00
2024-02-18 07:33:27 +01:00
2024-03-23 18:55:21 +01:00
2024-03-23 18:55:21 +01:00
2024-02-18 07:30:01 +01:00
2024-02-18 07:30:01 +01:00
2024-03-13 23:29:02 +01:00
2024-03-21 11:31:32 -03:00
2024-03-21 11:31:32 -03:00
2024-03-23 18:55:21 +01:00
2024-03-13 23:29:02 +01:00
2024-02-13 20:25:49 -03:00
2024-02-13 20:25:49 -03:00
2024-02-13 20:25:49 -03:00
2024-03-07 09:15:18 +01:00
2024-03-07 09:15:18 +01:00
2024-02-21 00:24:44 +01:00
2024-02-10 00:04:16 -03:00
2024-03-15 12:51:15 +01:00
2024-03-15 12:51:15 +01:00
2024-02-12 00:50:47 +01:00
2024-02-09 08:16:25 +01:00
2024-03-25 20:44:30 +00:00
2024-02-29 10:30:16 +01:00
2024-03-19 09:44:59 -03:00
2024-03-04 17:39:25 +01:00
2024-02-21 00:24:44 +01:00
2024-03-23 11:34:34 +01:00
2024-03-23 11:34:34 +01:00
2024-02-20 00:09:05 +01:00
2024-02-20 00:09:05 +01:00
2024-02-21 00:24:44 +01:00
2024-03-07 08:53:31 -03:00
2024-03-07 08:53:31 -03:00
2024-02-20 17:31:40 +11:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:30 -03:00
2024-03-23 18:54:36 +01:00
2024-02-09 08:16:25 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-08 07:37:56 +01:00
2024-03-12 17:36:05 -04:00
2024-03-07 08:53:30 -03:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-03-22 17:04:05 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-22 23:38:16 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-14 14:53:41 +01:00
2024-03-22 23:38:16 +01:00
2024-03-22 23:38:16 +01:00
2024-02-14 14:53:41 +01:00
2024-03-01 01:34:26 +01:00
2024-03-01 01:34:26 +01:00
2024-02-21 00:24:44 +01:00
2024-02-14 14:53:41 +01:00
2024-02-21 00:24:44 +01:00
2024-03-14 21:22:45 +01:00
2024-03-04 02:11:38 +01:00
2024-03-04 02:11:38 +01:00
2024-03-04 02:11:38 +01:00
2024-02-14 14:53:41 +01:00
2024-03-25 08:54:40 +01:00
2024-02-21 20:08:25 +02:00
2024-02-21 20:08:25 +02:00
2024-03-23 11:37:44 +01:00
2024-03-05 21:25:56 +08:00
2024-02-14 14:53:41 +01:00
2024-03-12 14:37:49 +01:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-09 08:16:25 +01:00
2024-02-22 01:45:31 +01:00
2024-02-09 08:16:25 +01:00
2024-03-14 21:52:16 +01:00
2024-03-19 15:52:05 +01:00
2024-02-14 14:53:41 +01:00
2024-03-17 15:10:05 +01:00
2024-03-15 12:51:15 +01:00
2024-03-17 15:10:05 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-26 06:30:45 +01:00
2024-03-07 08:53:30 -03:00
2024-02-14 14:53:41 +01:00
2024-02-09 08:16:25 +01:00
2024-02-21 00:24:44 +01:00
2024-03-01 01:35:42 +01:00
2024-03-01 01:35:42 +01:00
2024-02-21 00:24:44 +01:00
2024-02-08 14:00:37 +01:00
2024-02-08 14:00:37 +01:00
2024-02-08 14:00:37 +01:00
2024-02-08 14:00:37 +01:00
2024-02-08 14:00:37 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-14 14:53:41 +01:00
2024-03-08 07:37:55 +01:00
2024-03-20 19:15:05 +02:00
2024-03-15 12:51:15 +01:00
2024-03-07 08:53:30 -03:00
2024-03-26 01:19:17 +01:00
2024-03-07 08:53:30 -03:00
2024-03-13 00:52:30 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-03-15 12:51:15 +01:00
2024-03-15 12:51:15 +01:00
2024-03-08 07:38:38 +01:00
2024-02-21 00:24:44 +01:00
2024-02-14 14:53:41 +01:00
2024-02-21 00:24:44 +01:00
2024-03-10 19:49:05 +01:00
2024-03-07 08:53:30 -03:00
2024-03-18 12:00:14 +08:00
2024-02-29 13:36:41 +08:00
2024-02-29 13:36:41 +08:00
2024-02-29 13:36:41 +08:00
2024-02-29 13:36:41 +08:00
2024-02-29 13:36:41 +08:00
2024-02-29 13:36:41 +08:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:30 -03:00
2024-03-04 02:11:38 +01:00
2024-03-01 01:35:42 +01:00
2024-02-09 08:16:25 +01:00
2024-02-09 08:16:25 +01:00
2024-02-09 08:16:25 +01:00
2024-02-17 14:33:35 +02:00
2024-02-17 14:33:35 +02:00
2024-02-09 08:16:25 +01:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-27 00:11:21 +01:00
2024-02-27 00:11:21 +01:00
2024-02-21 00:24:44 +01:00
2024-02-17 09:51:23 -03:00
2024-02-22 01:45:31 +01:00
2024-02-09 08:16:25 +01:00
2024-03-02 02:54:11 +01:00
2024-03-02 02:54:11 +01:00
2024-02-09 08:16:25 +01:00
2024-02-14 13:18:42 +01:00
2024-02-18 07:30:01 +01:00
2024-02-08 20:36:04 +01:00
2024-02-18 07:30:01 +01:00
2024-03-14 20:37:03 +01:00
2024-03-14 20:37:03 +01:00
2024-03-14 20:37:03 +01:00
2024-03-12 14:37:49 +01:00
2024-02-22 01:45:31 +01:00
2024-03-15 12:51:15 +01:00
2024-02-14 14:53:41 +01:00
2024-03-07 08:53:29 -03:00
2024-03-07 08:53:29 -03:00
2024-03-18 12:00:14 +08:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-02-14 14:53:41 +01:00
2024-03-18 12:00:14 +08:00
2024-03-18 12:00:14 +08:00
2024-02-18 07:30:01 +01:00
2024-02-18 07:30:01 +01:00
2024-02-11 11:15:15 +01:00
2024-02-14 14:53:41 +01:00
2024-03-08 00:58:40 +01:00
2024-03-08 00:58:40 +01:00
2024-03-08 00:58:40 +01:00
2024-03-20 19:15:05 +02:00
2024-03-07 08:53:30 -03:00
2024-03-07 08:53:30 -03:00
2024-02-14 14:53:41 +01:00
2024-03-25 21:41:26 +01:00
2024-03-07 08:53:30 -03:00
2024-03-17 15:10:05 +01:00
2024-03-10 19:49:04 +01:00
2024-02-17 14:45:49 +02:00
2024-02-17 14:45:49 +02:00
2024-03-25 08:54:40 +01:00
2024-03-25 08:54:40 +01:00
2024-03-25 08:54:40 +01:00
2024-03-07 09:00:39 +01:00
2024-03-21 11:31:32 -03:00
2024-03-07 09:00:30 +01:00
2024-03-25 08:54:40 +01:00
2024-03-23 22:49:29 +08:00
2024-02-25 18:30:34 -03:00
2024-03-08 07:37:56 +01:00
2024-02-22 01:45:31 +01:00
2024-02-09 08:16:25 +01:00
2024-03-10 13:51:37 +01:00
2024-03-10 13:50:26 +01:00
2024-02-09 08:16:25 +01:00
2024-03-10 13:51:37 +01:00