git.osdn.net Git - android-x86/external-llvm.git/commit

author	Ayman Musa <ayman.musa@intel.com>
	Tue, 27 Jun 2017 12:08:37 +0000 (12:08 +0000)
committer	Ayman Musa <ayman.musa@intel.com>
	Tue, 27 Jun 2017 12:08:37 +0000 (12:08 +0000)
commit	ae1022198b4de754c4e839a2429fd77acdfbfbca
tree	9f4a4b6696f6f9b6626862afeec215d4d2c704e2	tree \| snapshot
parent	d954633ce298f8c2a15419d42e2697e634af52df	commit \| diff

Recommitting rL305465 after fixing bug in TableGen in rL306251 & rL306371

[X86][AVX512] Improve lowering of AVX512 compare intrinsics (remove redundant shift left+right instructions).

AVX512 compare instructions return v*i1 types.
In cases where the number of elements in the returned value are less than 8, clang adds zeroes to get a mask of v8i1 type.
Later on it's replaced with CONCAT_VECTORS, which then is lowered to many DAG nodes including insert/extract element and shift right/left nodes.
The fact that AVX512 compare instructions put the result in a k register and zeroes all its upper bits allows us to remove the extra nodes simply by copying the result to the required register class.

When lowering, identify these cases and transform them into an INSERT_SUBVECTOR node (marked legal), then catch this pattern in instructions selection phase and transform it into one avx512 cmp instruction.

Differential Revision: https://reviews.llvm.org/D33188

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@306402 91177308-0d34-0410-b5e6-96231b3b80d8

lib/Target/X86/X86ISelLowering.cpp		diff \| blob \| history
lib/Target/X86/X86InstrAVX512.td		diff \| blob \| history
test/CodeGen/X86/avx512vl-intrinsics-upgrade.ll		diff \| blob \| history
test/CodeGen/X86/avx512vl-vec-masked-cmp.ll	[new file with mode: 0644]	blob
test/CodeGen/X86/compress_expand.ll		diff \| blob \| history
test/CodeGen/X86/masked_memop.ll		diff \| blob \| history