summaryrefslogtreecommitdiffstats
path: root/scripts/module-common.lds
diff options
context:
space:
mode:
authorArnd Bergmann <arnd@arndb.de>2016-03-16 17:39:17 +0100
committerMartin K. Petersen <martin.petersen@oracle.com>2016-03-18 20:29:58 +0100
commitef3fb2422ffe92ead0579c21a02095b329434900 (patch)
tree7ec167e9ce2e8681ec3d412ddcbbfffc64a8531f /scripts/module-common.lds
parentfnic: move printk()s outside of the critical code section. (diff)
downloadlinux-ef3fb2422ffe92ead0579c21a02095b329434900.tar.xz
linux-ef3fb2422ffe92ead0579c21a02095b329434900.zip
scsi: fc: use get/put_unaligned64 for wwn access
A bug in the gcc-6.0 prerelease version caused at least one driver (lpfc) to have excessive stack usage when dealing with wwn data, on the ARM architecture. lpfc_scsi.c: In function 'lpfc_find_next_oas_lun': lpfc_scsi.c:117:1: warning: the frame size of 1152 bytes is larger than 1024 bytes [-Wframe-larger-than=] I have reported this as a gcc regression in https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70232 However, using a better implementation of wwn_to_u64() not only helps with the particular gcc problem but also leads to better object code for any version or architecture. The kernel already provides get_unaligned_be64() and put_unaligned_be64() helper functions that provide an optimized implementation with the desired semantics. The lpfc_find_next_oas_lun() function in the example that grew from 1146 bytes to 5144 bytes when moving from gcc-5.3 to gcc-6.0 is now 804 bytes, as the optimized get_unaligned_be64() load can be done in three instructions. The stack usage is now down to 28 bytes from 128 bytes with gcc-5.3 before. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Reviewed-by: Hannes Reinicke <hare@suse.de> Reviewed-by: Ewan Milne <emilne@redhat.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
Diffstat (limited to 'scripts/module-common.lds')
0 files changed, 0 insertions, 0 deletions