OSDN Git Service

AMDGPU: More bits of frame index are known to be zero
authorMatt Arsenault <Matthew.Arsenault@amd.com>
Sat, 27 Feb 2016 20:26:57 +0000 (20:26 +0000)
committerMatt Arsenault <Matthew.Arsenault@amd.com>
Sat, 27 Feb 2016 20:26:57 +0000 (20:26 +0000)
commit26419a11aded07dd0b7033781b7d0e323a2019f5
treea8dfdd161af7d5dad4e1c8cea198ac4c42cffd8a
parentcc3610dedb011741ae788c2e25c3f531a87913be
AMDGPU: More bits of frame index are known to be zero

The maximum private allocation for the whole GPU is 4G,
so the maximum possible index for a single workitem is the
maximum size divided by the smallest granularity for a dispatch.

This increases the number of known zero high bits, which
enables more offset folding. The maximum private size per
workitem with this is 128M but may be smaller still.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@262153 91177308-0d34-0410-b5e6-96231b3b80d8
lib/Target/AMDGPU/AMDGPU.td
lib/Target/AMDGPU/AMDGPUSubtarget.cpp
lib/Target/AMDGPU/AMDGPUSubtarget.h
lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/private-element-size.ll
test/CodeGen/AMDGPU/scratch-buffer.ll