OSDN Git Service
Jonas Devlieghere [Sat, 23 Jun 2018 19:04:10 +0000 (19:04 +0000)]
[llc] Fix sanitizer failure.
Store the string on the stack rather than just the StringRef.
Fix sanitizer bots:
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/19948
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-msan/builds/5500
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335423
91177308-0d34-0410-b5e6-
96231b3b80d8
Duncan P. N. Exon Smith [Sat, 23 Jun 2018 18:39:44 +0000 (18:39 +0000)]
ADT: Use EBO to shrink SmallVector size 1
SmallVectorStorage is empty when its size is 1; use inheritance so that
the empty base class optimization kicks in.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335421
91177308-0d34-0410-b5e6-
96231b3b80d8
Jonas Devlieghere [Sat, 23 Jun 2018 16:51:10 +0000 (16:51 +0000)]
[llc] Use WithColor for printing errors/warnings
Use the WithColor helper from support to print errors and warnings.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335418
91177308-0d34-0410-b5e6-
96231b3b80d8
Jonas Devlieghere [Sat, 23 Jun 2018 16:50:09 +0000 (16:50 +0000)]
[llvm-config] Use WithColor for printing errors.
Use the WithColor helper from support to print errors.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335417
91177308-0d34-0410-b5e6-
96231b3b80d8
Jonas Devlieghere [Sat, 23 Jun 2018 16:49:07 +0000 (16:49 +0000)]
[llvm-mt] Use WithColor for printing errors.
Use the WithColor helper from support to print errors.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335416
91177308-0d34-0410-b5e6-
96231b3b80d8
Jonas Devlieghere [Sat, 23 Jun 2018 16:48:03 +0000 (16:48 +0000)]
[TableGen] Use WithColor for printing errors/warnings
Use the WithColor helper from support to print errors and warnings.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335415
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Sat, 23 Jun 2018 06:15:04 +0000 (06:15 +0000)]
[X86] Make %eiz usage in 64-bit mode, force a 0x67 address size prefix. Fix some test CHECK lines.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335414
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Sat, 23 Jun 2018 06:03:48 +0000 (06:03 +0000)]
[X86] Teach disassembler to use %eip instead of %rip when 0x67 prefix is used on a rip-relative address.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335413
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Sat, 23 Jun 2018 05:53:00 +0000 (05:53 +0000)]
[X86][AsmParser] Improve base/index register checks.
-Ensure EIP isn't used with an index reigster.
-Ensure EIP isn't used as index register.
-Ensure base register isn't a vector register.
-Ensure eiz/riz usage matches the size of their base register.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335412
91177308-0d34-0410-b5e6-
96231b3b80d8
Stanislav Mekhanoshin [Sat, 23 Jun 2018 04:01:28 +0000 (04:01 +0000)]
Fix invariant fdiv hoisting in LICM
FDiv is replaced with multiplication by reciprocal and invariant
reciprocal is hoisted out of the loop, while multiplication remains
even if invariant.
Switch checks for all invariant operands and only invariant
denominator to fix the issue.
Differential Revision: https://reviews.llvm.org/D48447
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335411
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Sat, 23 Jun 2018 03:05:39 +0000 (03:05 +0000)]
[AMDGPU] Update includes for intrinsic changes :(
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335409
91177308-0d34-0410-b5e6-
96231b3b80d8
Lang Hames [Sat, 23 Jun 2018 02:22:10 +0000 (02:22 +0000)]
[ORC] Fix formatting and list pending queries in VSO::dump.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335408
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Sat, 23 Jun 2018 02:02:38 +0000 (02:02 +0000)]
[IR] Split Intrinsics.inc into enums and implementations
Implements PR34259
Intrinsics.h is a very popular header. Most LLVM TUs care about things
like dbg_value, but they don't care how they are implemented. After I
split these out, IntrinsicImpl.inc is 1.7 MB, so this saves each LLVM TU
from scanning 1.7 MB of source that gets pre-processed away.
It also means we can modify intrinsic properties without triggering a
full rebuild, but that's probably less of a win.
I think the next best thing to do would be to split out the target
intrinsics into their own header. Very, very few TUs care about
target-specific intrinsics. It's very hard to split up the target
independent intrinsics like llvm.expect, assume, and dbg.value, though.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335407
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Sat, 23 Jun 2018 01:19:49 +0000 (01:19 +0000)]
Avoid including intrin.h from MathExtras.h
This is repeatably worth 0.3s compile time on MathExtras.cpp. This is a
very popular header, and it basically pulls all Intel intrinsics into
every LLVM TU. Let's not do that.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335406
91177308-0d34-0410-b5e6-
96231b3b80d8
Fangrui Song [Sat, 23 Jun 2018 00:15:33 +0000 (00:15 +0000)]
[ELF] Change isSectionData to exclude SHF_EXECINSTR
Summary:
This affects what sections are displayed as "DATA" in llvm-objdump.
The other user llvm-size is unaffected.
Before, a "TEXT" section is also "DATA", which seems weird.
The sh_flags condition matches that of bfd's SEC_DATA but the sh_type
condition uses (== SHF_PROGBITS) instead of bfd's (!= SHT_NOBITS).
bfd's SEC_DATA is not appealing as so many sections will be shown as DATA.
Reviewers: jyknight, Bigcheese
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/D48472
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335405
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Sat, 23 Jun 2018 00:03:20 +0000 (00:03 +0000)]
[X86][AsmParser] Rework that allows (%dx) to be used in place of %dx with in/out instructions.
Previously, to support (%dx) we left a wide open hole in our 16-bit memory address checking. This let this address value be used with any instruction without error in the parser. It would later fail in the encoder with an assertion failure on debug builds and who knows what on release builds.
This patch passes the mnemonic down to the memory operand parsing function so we can allow the (%dx) form only on specific instructions.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335403
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Fri, 22 Jun 2018 23:53:22 +0000 (23:53 +0000)]
[RuntimeDyld] Implement the ELF PIC large code model relocations
Prerequisite for https://reviews.llvm.org/D47211 which improves our ELF
large PIC codegen.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335402
91177308-0d34-0410-b5e6-
96231b3b80d8
Eli Friedman [Fri, 22 Jun 2018 22:58:55 +0000 (22:58 +0000)]
[LoopReroll] Rewrite induction variable rewriting.
This gets rid of a bunch of weird special cases; instead, just use SCEV
rewriting for everything. In addition to being simpler, this fixes a
bug where we would use the wrong stride in certain edge cases.
The one bit I'm not quite sure about is the trip count handling,
specifically the FIXME about overflow. In general, I think we need to
widen the exit condition, but that's probably not profitable if the new
type isn't legal, so we probably need a check somewhere. That said, I
don't think I'm making the existing problem any worse.
As a followup to this, a bunch of IV-related code in root-finding could
be cleaned up; with SCEV-based rewriting, there isn't any reason to
assume a loop will have exactly one or two PHI nodes.
Differential Revision: https://reviews.llvm.org/D45191
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335400
91177308-0d34-0410-b5e6-
96231b3b80d8
George Burgess IV [Fri, 22 Jun 2018 22:34:07 +0000 (22:34 +0000)]
[MSSA] Remove incorrect comment + `auto`ify dyn_cast results; NFC
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335399
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Fri, 22 Jun 2018 22:28:39 +0000 (22:28 +0000)]
[X86][AsmParser] Keep track of whether an explicit scale was specified while parsing an address in Intel syntax. Use it for improved error checking.
This allows us to check these:
-16-bit addressing doesn't support scale so we should error if we find one there.
-Multiplying ESP/RSP by a scale even if the scale is 1 should be an error because ESP/RSP can't be an index.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335398
91177308-0d34-0410-b5e6-
96231b3b80d8
Fangrui Song [Fri, 22 Jun 2018 22:20:10 +0000 (22:20 +0000)]
[llvm-size] Make global variables static
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335397
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Fri, 22 Jun 2018 22:07:26 +0000 (22:07 +0000)]
[x86] add more tests for bit hacking opportunities with setcc; NFC
Missed cases where the input and output are the same size in rL335391.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335396
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Fri, 22 Jun 2018 22:06:33 +0000 (22:06 +0000)]
[PowerPC] add more tests for bit hacking opportunities with setcc; NFC
Missed cases where the input and output are the same size in rL335390.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335395
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Fri, 22 Jun 2018 21:57:24 +0000 (21:57 +0000)]
[X86][AsmParser] In Intel syntax make sure we support ESP/RSP being the second register in memory expressions like [EAX+ESP].
By default, the second register gets assigned to the index register slot. But ESP can't be an index register so we need to swap it with the other register.
There's still a slight bug that we allow [EAX+ESP*1]. The existence of the multiply even though its with 1 should force ESP to the index register and trigger an error, but it doesn't currently.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335394
91177308-0d34-0410-b5e6-
96231b3b80d8
Matt Davis [Fri, 22 Jun 2018 21:35:26 +0000 (21:35 +0000)]
[llvm-mca] Remove unnecessary include and forward decl in RCU. NFC.
The DispatchUnit is no longer a dependency of RCU, so this patch removes a
stale include and forward decl. This patch also cleans up some comments.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335392
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Fri, 22 Jun 2018 21:16:54 +0000 (21:16 +0000)]
[x86] add tests for bit hacking opportunities with setcc; NFC
We likely gave up on folding some select-of-constants patterns in
IR with rL331486, and we need to recover those in the DAG.
The tests without select are based on our current DAGCombiner
optimizations for select-of-constants.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335391
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Fri, 22 Jun 2018 21:16:29 +0000 (21:16 +0000)]
[PowerPC] add tests for bit hacking opportunities with setcc; NFC
We likely gave up on folding some select-of-constants patterns in
IR with rL331486, and we need to recover those in the DAG.
The tests without select are based on our current DAGCombiner
optimizations for select-of-constants.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335390
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Fri, 22 Jun 2018 21:09:31 +0000 (21:09 +0000)]
[X86] Add test cases showing missed select simplifcation for MCU when icmp is in a slightly different form.
These test cases show that the "(select (and (x , 0x1) == 0), y, (z ^ y) ) -> (-(and (x , 0x1)) & z ) ^ y" doesn't work if the select condition is changed to (and (x, 0x1) != 1)
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335389
91177308-0d34-0410-b5e6-
96231b3b80d8
Aditya Nandakumar [Fri, 22 Jun 2018 20:58:51 +0000 (20:58 +0000)]
[GISel]: Add G_ADDRSPACE_CAST Opcode
Added IRTranslator support for addrspacecast.
https://reviews.llvm.org/D48469
reviewed by: volkan
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335388
91177308-0d34-0410-b5e6-
96231b3b80d8
Fangrui Song [Fri, 22 Jun 2018 20:29:42 +0000 (20:29 +0000)]
[gdb] Use Latin-1 to decode StringRef
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335387
91177308-0d34-0410-b5e6-
96231b3b80d8
Tobias Edler von Koch [Fri, 22 Jun 2018 20:23:21 +0000 (20:23 +0000)]
Re-land "[LTO] Enable module summary emission by default for regular LTO"
Since we are now producing a summary also for regular LTO builds, we
need to run the NameAnonGlobals pass in those cases as well (the
summary cannot handle anonymous globals).
See https://reviews.llvm.org/D34156 for details on the original change.
This reverts commit
6c9ee4a4a438a8059aacc809b2dd57128fccd6b3.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335385
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Fri, 22 Jun 2018 20:20:38 +0000 (20:20 +0000)]
[X86] Don't accept (%si,%bp) 16-bit address expressions.
The second register is the index register and should only be %si or %di if used with a base register. And in that case the base register should be %bp or %bx.
This makes us compatible with gas.
We do still need to support both orders with Intel syntax which uses [bp+si] and [si+bp]
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335384
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Fri, 22 Jun 2018 19:42:21 +0000 (19:42 +0000)]
[X86][AsmParser] Allow (%bp,%si) and (%bp,%di) to be encoded without using a zero displacement.
(%bp) can't be encoded without a displacement. The encoding is instead used for displacement alone. So a 1 byte displacement of 0 must be used. But if there is an index register we can encode without a displacement.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335379
91177308-0d34-0410-b5e6-
96231b3b80d8
Konstantin Zhuravlyov [Fri, 22 Jun 2018 19:23:18 +0000 (19:23 +0000)]
AMDHSA: Put old assembler docs back
Until we switch to code object v3 by default.
Follow up for https://reviews.llvm.org/D47736.
Differential Revision: https://reviews.llvm.org/D48497
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335378
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 18:31:57 +0000 (18:31 +0000)]
[X86][SSE] Add sdiv by (nonuniform) minus one tests (PR37119)
Test cases from D45806
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335376
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Fri, 22 Jun 2018 17:50:40 +0000 (17:50 +0000)]
[X86][AsmParser] Check for invalid 16-bit base register in Intel syntax.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335373
91177308-0d34-0410-b5e6-
96231b3b80d8
Craig Topper [Fri, 22 Jun 2018 17:15:58 +0000 (17:15 +0000)]
[X86] Don't allow ESP/RSP to be used as an index register in assembly.
Fixes PR37892
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335370
91177308-0d34-0410-b5e6-
96231b3b80d8
Alina Sbirlea [Fri, 22 Jun 2018 17:14:35 +0000 (17:14 +0000)]
[LoopUnswitch]Fix comparison for DomTree updates.
Summary:
In LoopUnswitch when replacing a branch Parent -> Succ with a conditional
branch Parent -> True & Parent->False, the DomTree updates should insert an edge for
each of True/False if True/False are different than Succ, and delete Parent->Succ edge
if both are different. The comparison with Succ appears to be incorect,
it's comparing with Parent instead.
There is no test failing either before or after this change, but it seems to me this is
the right way to do the update.
Reviewers: chandlerc, kuhar
Subscribers: sanjoy, jlebar, llvm-commits
Differential Revision: https://reviews.llvm.org/D48457
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335369
91177308-0d34-0410-b5e6-
96231b3b80d8
Andrea Di Biagio [Fri, 22 Jun 2018 17:03:40 +0000 (17:03 +0000)]
[llvm-mca] Remove redundant call. NFC
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335368
91177308-0d34-0410-b5e6-
96231b3b80d8
Easwaran Raman [Fri, 22 Jun 2018 17:01:13 +0000 (17:01 +0000)]
[X86] Add a test to show missed opportunity to generate vfnmadd
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335367
91177308-0d34-0410-b5e6-
96231b3b80d8
Krzysztof Parzyszek [Fri, 22 Jun 2018 16:38:38 +0000 (16:38 +0000)]
Initialize LiveRegs once in BranchFolder::mergeCommonTails
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335365
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 16:37:34 +0000 (16:37 +0000)]
[SLPVectorizer] Support alternate opcodes in tryToVectorizeList
Enable tryToVectorizeList to support InstructionsState alternate opcode patterns at a root (build vector etc.) as well as further down the vectorization tree.
NOTE: This patch reduces some of the debug reporting if there are opcode mismatches - I can try to add it back if it proves a problem. But it could get rather messy trying to provide equivalent verbose debug strings via getSameOpcode etc.
Differential Revision: https://reviews.llvm.org/D48488
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335364
91177308-0d34-0410-b5e6-
96231b3b80d8
Andrea Di Biagio [Fri, 22 Jun 2018 16:37:05 +0000 (16:37 +0000)]
[llvm-mca] Set the operand ID for implicit register reads/writes. NFC
Also, move the definition of InstRef at the end of Instruction.h to avoid a
forward declaration.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335363
91177308-0d34-0410-b5e6-
96231b3b80d8
Matt Davis [Fri, 22 Jun 2018 16:17:26 +0000 (16:17 +0000)]
[llvm-mca] Introduce a sequential container of Stages
Summary:
Remove explicit stages and introduce a list of stages.
A pipeline should be composed of an arbitrary list of stages, and not any
predefined list of stages in the Backend. The Backend should not know of any
particular stage, rather it should only be concerned that it has a list of
stages, and that those stages will fulfill the contract of what it means to be
a Stage (namely pre/post/execute a given instruction).
For now, we leave the original set of stages defined in the Backend ctor;
however, I imagine these will be moved out at a later time.
This patch makes an adjustment to the semantics of Stage::isReady.
Specifically, what the Backend really needs to know is if a Stage has
unfinished work. With that said, it is more appropriately renamed
Stage::hasWorkToComplete(). This change will clean up the check in
Backend::run(), allowing us to query each stage to see if there is unfinished
work, regardless of what subclass a stage might be. I feel that this change
simplifies the semantics too, but that's a subjective statement.
Given how RetireStage and ExecuteStage handle data in their preExecute(), I've
had to change the order of Retire and Execute in our stage list. Retire must
complete any of its preExecute actions before ExecuteStage's preExecute can
take control. This is mainly because both stages utilize the RCU. In the
meantime, I want to see if I can adjust that or remove that coupling.
Reviewers: andreadb, RKSimon, courbet
Reviewed By: andreadb
Subscribers: tschuett, gbedwell, llvm-commits
Differential Revision: https://reviews.llvm.org/D46907
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335361
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 16:10:26 +0000 (16:10 +0000)]
[SLPVectorizer] reorderAltShuffleOperands should just take InstructionsState. NFCI.
All calls were extracting the InstructionsState Opcode/AltOpcode values so we might as well pass it directly
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335359
91177308-0d34-0410-b5e6-
96231b3b80d8
Paul Robinson [Fri, 22 Jun 2018 15:19:45 +0000 (15:19 +0000)]
Fix test again, try to keep all targets happy
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335356
91177308-0d34-0410-b5e6-
96231b3b80d8
Paul Robinson [Fri, 22 Jun 2018 15:07:26 +0000 (15:07 +0000)]
Fix test, nop is not always 1 byte
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335353
91177308-0d34-0410-b5e6-
96231b3b80d8
Paul Robinson [Fri, 22 Jun 2018 14:16:11 +0000 (14:16 +0000)]
[DWARFv5] Allow ".loc 0" to refer to the root file.
DWARF v5 explicitly represents file #0 in the line table. Prior
versions did not, so ".loc 0" is still an error in those cases.
Differential Revision: https://reviews.llvm.org/D48452
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335350
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 14:04:06 +0000 (14:04 +0000)]
[SLPVectorizer] Relax alternate opcodes to accept any BinaryOperator pair
SLP currently only accepts (F)Add/(F)Sub alternate counterpart ops to be merged into an alternate shuffle.
This patch relaxes this to accept any pair of BinaryOperator opcodes instead, assuming the target's cost model accepts the vectorization+shuffle.
Differential Revision: https://reviews.llvm.org/D48477
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335349
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 13:53:58 +0000 (13:53 +0000)]
[SLPVectorizer][X86] Add alternate opcode tests for simple build vector cases
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335348
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Fri, 22 Jun 2018 13:44:42 +0000 (13:44 +0000)]
[InstCombine] add shuffle+binops test from PR37806; NFC
This one shows another pattern that we'll need to match
in some cases, but the current ordering of folds allows
us to match this as 2 binops before simplification takes
place.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335347
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Fri, 22 Jun 2018 13:19:25 +0000 (13:19 +0000)]
[InstCombine] add tests for shuffle-with-different-binops; NFC
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335345
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Fri, 22 Jun 2018 12:46:16 +0000 (12:46 +0000)]
[InstCombine] rearrange shuffle-of-binops logic; NFC
The commutative matcher makes things more complicated
here, and I'm planning an enhancement where this
form is more readable.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335343
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 12:41:48 +0000 (12:41 +0000)]
[X86] Regenerate tests to include fma comments
Noticed in the review of D48467
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335342
91177308-0d34-0410-b5e6-
96231b3b80d8
Gabor Buella [Fri, 22 Jun 2018 12:01:43 +0000 (12:01 +0000)]
[X86] Add notes to a few intrinsics
This a change corresponding to the clang change in
https://reviews.llvm.org/D45616
Reviewers: craig.topper, uriel.k, RKSimon, andrew.w.kaylor, spatel, scanon, efriedma
Reviewed By: craig.topper
Differential Revision: https://reviews.llvm.org/D48280
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335340
91177308-0d34-0410-b5e6-
96231b3b80d8
George Rimar [Fri, 22 Jun 2018 10:53:47 +0000 (10:53 +0000)]
Recommit r335333 "[MC] - Add .stack_size sections into groups and link them with .text"
With compilation fix.
Original commit message:
D39788 added a '.stack-size' section containing metadata on function stack sizes
to output ELF files behind the new -stack-size-section flag.
This change does following two things on top:
1) Imagine the case when there are -ffunction-sections flag given and there are text sections in COMDATs.
The patch adds a '.stack-size' section into corresponding COMDAT group, so that linker will be able to
eliminate them fast during resolving the COMDATs.
2) Patch sets a SHF_LINK_ORDER flag and links '.stack-size' with the corresponding .text.
With that linker will be able to do -gc-sections on dead stack sizes sections.
Differential revision: https://reviews.llvm.org/D46874
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335336
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 10:48:02 +0000 (10:48 +0000)]
[IR] Use Instruction::isBinaryOp helper instead of raw enum range tests. NFCI.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335335
91177308-0d34-0410-b5e6-
96231b3b80d8
George Rimar [Fri, 22 Jun 2018 10:27:33 +0000 (10:27 +0000)]
Revert r335332 "[MC] - Add .stack_size sections into groups and link them with .text"
It broke bots.
http://lab.llvm.org:8011/builders/clang-ppc64le-linux-lnt/builds/12891
http://lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/9443
http://lab.llvm.org:8011/builders/lldb-x86_64-ubuntu-14.04-buildserver/builds/25551
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335333
91177308-0d34-0410-b5e6-
96231b3b80d8
George Rimar [Fri, 22 Jun 2018 10:10:53 +0000 (10:10 +0000)]
[MC] - Add .stack_size sections into groups and link them with .text
D39788 added a '.stack-size' section containing metadata on function stack sizes
to output ELF files behind the new -stack-size-section flag.
This change does following two things on top:
1) Imagine the case when there are -ffunction-sections flag given and there are text sections in COMDATs.
The patch adds a '.stack-size' section into corresponding COMDAT group, so that linker will be able to
eliminate them fast during resolving the COMDATs.
2) Patch sets a SHF_LINK_ORDER flag and links '.stack-size' with the corresponding .text.
With that linker will be able to do -gc-sections on dead stack sizes sections.
Differential revision: https://reviews.llvm.org/D46874
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335332
91177308-0d34-0410-b5e6-
96231b3b80d8
Sjoerd Meijer [Fri, 22 Jun 2018 10:03:03 +0000 (10:03 +0000)]
Recommit of r335326, with the test fixed that I missed.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335331
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Pilgrim [Fri, 22 Jun 2018 09:45:31 +0000 (09:45 +0000)]
[CostModel][AArch64] Add some initial costs for SK_Select and SK_PermuteSingleSrc
AArch64 was only setting costs for SK_Transpose, which meant that many of the simpler shuffles (e.g. SK_Select and SK_PermuteSingleSrc for larger vector elements) was being severely overestimated by the default shuffle expansion.
This patch adds costs to help improve SLP performance and avoid a regression in reductions introduced by D48174.
I'm not very knowledgeable about AArch64 shuffle lowering so I've kept the extra costs to a minimum - someone who knows this code can add extra costs which should improve vectorization a lot more.
Differential Revision: https://reviews.llvm.org/D48172
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335329
91177308-0d34-0410-b5e6-
96231b3b80d8
Sjoerd Meijer [Fri, 22 Jun 2018 09:17:08 +0000 (09:17 +0000)]
Reverting r335326 while I look at the test failure
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335328
91177308-0d34-0410-b5e6-
96231b3b80d8
Eugene Leviant [Fri, 22 Jun 2018 08:57:01 +0000 (08:57 +0000)]
Revert r335324 due to a builtbot failure
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335327
91177308-0d34-0410-b5e6-
96231b3b80d8
Sjoerd Meijer [Fri, 22 Jun 2018 08:48:13 +0000 (08:48 +0000)]
[ARM] ARMv6m and v8m.baseline strict align
This sets target feature FeatureStrictAlign for Armv6-m and Armv8-m.baseline,
because it has no support for unaligned accesses.
It looks like we always pass target feature "+strict-align" from
Clang, so this is not a user facing problem, but querying the subtarget
(in e.g. llc) for unaligned access support is incorrect.
Differential Revision: https://reviews.llvm.org/D48437
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335326
91177308-0d34-0410-b5e6-
96231b3b80d8
Matt Arsenault [Fri, 22 Jun 2018 08:39:52 +0000 (08:39 +0000)]
AMDGPU: Add patterns for i32/i64 local atomic load/store
Not sure why the 32/64 split is needed in the atomic_load
store hierarchies. The regular PatFrags do this, but we don't
do it for the existing handling for global.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335325
91177308-0d34-0410-b5e6-
96231b3b80d8
Eugene Leviant [Fri, 22 Jun 2018 08:29:36 +0000 (08:29 +0000)]
[Evaluator] Improve evaluation of call instruction
Differential revision: https://reviews.llvm.org/D46584
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335324
91177308-0d34-0410-b5e6-
96231b3b80d8
Mikhail Dvoretckii [Fri, 22 Jun 2018 08:28:05 +0000 (08:28 +0000)]
[X86] Changing the check for valid inputs in combineScalarToVector
Changing the logic of scalar mask folding to check for valid input types rather
than against invalid ones, making it more robust and fixing PR37879.
Differential Revision: https://reviews.llvm.org/D48366
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335323
91177308-0d34-0410-b5e6-
96231b3b80d8
Chandler Carruth [Fri, 22 Jun 2018 05:33:57 +0000 (05:33 +0000)]
Revert r335306 (and r335314) - the Call Graph Profile pass.
This is the first pass in the main pipeline to use the legacy PM's
ability to run function analyses "on demand". Unfortunately, it turns
out there are bugs in that somewhat-hacky approach. At the very least,
it leaks memory and doesn't support -debug-pass=Structure. Unclear if
there are larger issues or not, but this should get the sanitizer bots
back to green by fixing the memory leaks.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335320
91177308-0d34-0410-b5e6-
96231b3b80d8
Tom Stellard [Fri, 22 Jun 2018 03:04:35 +0000 (03:04 +0000)]
AMDGPU/GlobalISel: Default to using TableGen'd instruction selector
Summary:
We can select all instructions that are marked as legal in a full piglit run,
so now is a good time to make the TableGen'd instruction selector default
for all opcodes. This is NFC for a full piglit run, which is why there are
no tests.
Reviewers: arsenm, nhaehnle
Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits
Differential Revision: https://reviews.llvm.org/D48198
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335319
91177308-0d34-0410-b5e6-
96231b3b80d8
Tom Stellard [Fri, 22 Jun 2018 02:54:57 +0000 (02:54 +0000)]
AMDGPU/GlobalISel: legalize and select 32-bit G_ASHR
Reviewers: arsenm, nhaehnle
Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, llvm-commits, t-tye
Differential Revision: https://reviews.llvm.org/D48196
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335318
91177308-0d34-0410-b5e6-
96231b3b80d8
Chandler Carruth [Fri, 22 Jun 2018 02:43:41 +0000 (02:43 +0000)]
[LegacyPM] Fix PR37888 by teaching the legacy loop pass manager how to
clear out deleted loops from the current queue beyond just the current
loop.
This is important because SimpleLoopUnswitch will now enqueue the same
loop to be re-processed. When it does this with the legacy PM, we don't
have a way of canceling the rest of the pipeline and so we can end up
deleting the loop before we reprocess it. =/
This change also makes it easy to support deleting other loops in the
queue to process, although I don't have any use cases for that.
Differential Revision: https://reviews.llvm.org/D48470
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335317
91177308-0d34-0410-b5e6-
96231b3b80d8
Tom Stellard [Fri, 22 Jun 2018 02:34:29 +0000 (02:34 +0000)]
AMDGPU/GlobalISel: legalize and select 32-bit G_SITOFP
Reviewers: arsenm, nhaehnle
Reviewed By: arsenm
Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits
Differential Revision: https://reviews.llvm.org/D48195
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335316
91177308-0d34-0410-b5e6-
96231b3b80d8
Tom Stellard [Fri, 22 Jun 2018 00:44:29 +0000 (00:44 +0000)]
AMDGPU/GlobalISel: Implement select() for COPY
Reviewers: arsenm, nhaehnle
Reviewed By: nhaehnle
Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits
Differential Revision: https://reviews.llvm.org/D46151
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335315
91177308-0d34-0410-b5e6-
96231b3b80d8
Chandler Carruth [Fri, 22 Jun 2018 00:32:26 +0000 (00:32 +0000)]
Fix test failures after r335306 due to the pipeline changing.
This wasn't obvious for the author to fix because this is the first
pipeline use of the magic utility to get function analyses within
a module pass in the lagecy pass manager. Turns out that has a bug which
prevents dumping the structure of the pipeline and shows up as an
unnamed pass.
I've just left a FIXME for that as it doesn't seem likely worth fixing
and certainly shouldn't hold up getting the bots green.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335314
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Thu, 21 Jun 2018 23:56:59 +0000 (23:56 +0000)]
[InstCombine] fix shuffle-of-binops bug
With non-commutative binops, we could be using the same
variable value as operand 0 in 1 binop and operand 1 in
the other, so we have to check for that possibility and
bail out.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335312
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Thu, 21 Jun 2018 23:53:01 +0000 (23:53 +0000)]
[InstCombine] add test for shuffle-of-binops; NFC
This shows a miscompile that was missed in rL335283.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335311
91177308-0d34-0410-b5e6-
96231b3b80d8
Tom Stellard [Thu, 21 Jun 2018 23:38:20 +0000 (23:38 +0000)]
AMDGPU/GlobalISel: Implement select() for G_IMPLICIT_DEF
Reviewers: arsenm, nhaehnle
Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits
Differential Revision: https://reviews.llvm.org/D46150
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335307
91177308-0d34-0410-b5e6-
96231b3b80d8
Michael J. Spencer [Thu, 21 Jun 2018 23:31:10 +0000 (23:31 +0000)]
[Instrumentation] Add Call Graph Profile pass
This patch adds support for generating a call graph profile from Branch Frequency Info.
The CGProfile module pass simply gets the block profile count for each BB and scans for call instructions. For each call instruction it adds an edge from the current function to the called function with the current BB block profile count as the weight.
After scanning all the functions, it generates an appending module flag containing the data. The format looks like:
!llvm.module.flags = !{!0}
!0 = !{i32 5, !"CG Profile", !1}
!1 = !{!2, !3, !4} ; List of edges
!2 = !{void ()* @a, void ()* @b, i64 32} ; Edge from a to b with a weight of 32
!3 = !{void (i1)* @freq, void ()* @a, i64 11}
!4 = !{void (i1)* @freq, void ()* @b, i64 20}
Differential Revision: https://reviews.llvm.org/D48105
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335306
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Thu, 21 Jun 2018 23:06:33 +0000 (23:06 +0000)]
[X86] Fix 32-bit mingw comdat names, only add one underscore
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335304
91177308-0d34-0410-b5e6-
96231b3b80d8
Fangrui Song [Thu, 21 Jun 2018 22:34:29 +0000 (22:34 +0000)]
[gdb] Update llvm::Optional
Reviewers: dblaikie
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/D48461
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335303
91177308-0d34-0410-b5e6-
96231b3b80d8
Scott Linder [Thu, 21 Jun 2018 22:30:09 +0000 (22:30 +0000)]
[AMDGPU] Fix lit failures introduced in r335281
The tests do not support big-endian hosts.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335302
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Thu, 21 Jun 2018 22:25:42 +0000 (22:25 +0000)]
[IR] fix typo in comment; NFC
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335301
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Thu, 21 Jun 2018 22:19:05 +0000 (22:19 +0000)]
Revert r335297 "[X86] Implement more of x86-64 large and medium PIC code models"
MCJIT can't handle R_X86_64_GOT64 yet.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335300
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Thu, 21 Jun 2018 21:57:44 +0000 (21:57 +0000)]
[X86] Commit some comments that weren't in the medium code model patch
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335298
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Thu, 21 Jun 2018 21:55:08 +0000 (21:55 +0000)]
[X86] Implement more of x86-64 large and medium PIC code models
Summary:
The large code model allows code and data segments to exceed 2GB, which
means that some symbol references may require a displacement that cannot
be encoded as a displacement from RIP. The large PIC model even relaxes
the assumption that the GOT itself is within 2GB of all code. Therefore,
we need a special code sequence to materialize it:
.LtmpN:
leaq .LtmpN(%rip), %rbx
movabsq $_GLOBAL_OFFSET_TABLE_-.LtmpN, %rax # Scratch
addq %rax, %rbx # GOT base reg
From that, non-local references go through the GOT base register instead
of being PC-relative loads. Local references typically use GOTOFF
symbols, like this:
movq extern_gv@GOT(%rbx), %rax
movq local_gv@GOTOFF(%rbx), %rax
All calls end up being indirect:
movabsq $local_fn@GOTOFF, %rax
addq %rbx, %rax
callq *%rax
The medium code model retains the assumption that the code segment is
less than 2GB, so calls are once again direct, and the RIP-relative
loads can be used to access the GOT. Materializing the GOT is easy:
leaq _GLOBAL_OFFSET_TABLE_(%rip), %rbx # GOT base reg
DSO local data accesses will use it:
movq local_gv@GOTOFF(%rbx), %rax
Non-local data accesses will use RIP-relative addressing, which means we
may not always need to materialize the GOT base:
movq extern_gv@GOTPCREL(%rip), %rax
Direct calls are basically the same as they are in the small code model:
They use direct, PC-relative addressing, and the PLT is used for calls
to non-local functions.
This patch adds reasonably comprehensive testing of LEA, but there are
lots of interesting folding opportunities that are unimplemented.
Reviewers: chandlerc, echristo
Subscribers: hiraditya, llvm-commits
Differential Revision: https://reviews.llvm.org/D47211
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335297
91177308-0d34-0410-b5e6-
96231b3b80d8
Matthew Voss [Thu, 21 Jun 2018 21:43:20 +0000 (21:43 +0000)]
[GVN] Avoid casting a vector of size less than 8 bits to i8
Summary:
A reprise of D25849.
This crash was found through fuzzing some time ago and was documented in PR28879.
No check for load size has been added due to the following tests:
- Transforms/GVN/invariant.group.ll
- Transforms/GVN/pr10820.ll
These tests expect load sizes that are not a multiple of eight.
Thanks to @davide for the original patch.
Reviewers: nlopes, davide, RKSimon, reames, efriedma
Reviewed By: efriedma
Subscribers: davide, llvm-commits, Prazek
Differential Revision: https://reviews.llvm.org/D48330
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335294
91177308-0d34-0410-b5e6-
96231b3b80d8
Jonas Devlieghere [Thu, 21 Jun 2018 21:37:53 +0000 (21:37 +0000)]
[dsymutil] Force mmap'ing of binaries
After the recent refactoring that introduced parallel handling of
different object, the binary holder became unique per object file. This
defeats its optimization of caching archives, leading to an archive
being opened for every binary it contains. This is obviously unfortunate
and will need to be refactored soon.
Luckily in practice, the impact of this is limited as most files are
mmap'ed instead of memcopy'd. There's a caveat however: when the memory
buffer requires a null terminator and it's a multiple of the page size,
we allocate instead of mmap'ing. If this happens for a static archive,
we end up with N copies of it in memory, where N is the number of
objects in the archive, leading to exuberant memory usage. This provided
a stopgap solution to ensure that all the files it loads are mmap in
memory by removing the requirement for a terminating null byte.
Differential revision: https://reviews.llvm.org/D48397
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335293
91177308-0d34-0410-b5e6-
96231b3b80d8
Tim Shen [Thu, 21 Jun 2018 21:29:54 +0000 (21:29 +0000)]
[SCEV] Re-apply r335197 (with Polly fixes).
Summary:
This initiates a discussion on changing Polly accordingly while re-applying r335197 (D48338).
I have never worked on Polly. The proposed change to param_div_div_div_2.ll is not educated, but just patterns that match the output.
All LLVM files are already reviewed in D48338.
Reviewers: jdoerfert, bollu, efriedma
Subscribers: jlebar, sanjoy, hiraditya, llvm-commits, bixia
Differential Revision: https://reviews.llvm.org/D48453
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335292
91177308-0d34-0410-b5e6-
96231b3b80d8
Konstantin Zhuravlyov [Thu, 21 Jun 2018 20:28:19 +0000 (20:28 +0000)]
AMDGPU: Remove ability to reserve VGPRs for debugger
Differential Revision: https://reviews.llvm.org/D48234
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335288
91177308-0d34-0410-b5e6-
96231b3b80d8
Reid Kleckner [Thu, 21 Jun 2018 20:27:38 +0000 (20:27 +0000)]
[mingw] Fix GCC ABI compatibility for comdat things
Summary:
GCC and the binutils COFF linker do comdats differently from MSVC.
If we want to be ABI compatible, we have to do what they do, which is to
emit unique section names like ".text$_Z3foov" instead of short section
names like ".text". Otherwise, the binutils linker gets confused and
reports multiple definition errors when two object files from GCC and
Clang containing the same inline function are linked together.
The best description of the issue is probably at
https://github.com/Alexpux/MINGW-packages/issues/1677, we don't seem to
have a good one in our tracker.
I fixed up the .pdata and .xdata sections needed everywhere other than
32-bit x86. GCC doesn't use associative comdats for those, it appears to
rely on the section name.
Reviewers: smeenai, compnerd, mstorsjo, martell, mati865
Subscribers: llvm-commits, hiraditya
Differential Revision: https://reviews.llvm.org/D48402
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335286
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Thu, 21 Jun 2018 20:15:09 +0000 (20:15 +0000)]
[InstCombine] fold vector select of binops with constant ops to 1 binop (PR37806)
This is the simplest case from PR37806:
https://bugs.llvm.org/show_bug.cgi?id=37806
If we have a common variable operand used in a pair of binops with vector constants
that are vector selected together, then we can constant shuffle the constant vectors
to eliminate the shuffle instruction.
This has some tricky parts that are hopefully addressed in the tests and their
respective comments:
1. If the shuffle mask contains an undef element, then that lane of the result is
undef:
http://llvm.org/docs/LangRef.html#shufflevector-instruction
Therefore, we can replace the constant in that lane with an undef value except
for div/rem. With div/rem, an undef in the divisor would cause the whole op to
be undef. So I'm using the same hack as in D47686 - replace the undefs with '1'.
2. Intersect the wrapping and FMF of the original binops for the new binop. There
should be no extra poison or fast-math potential in the new binop that wasn't
possible in the original code.
3. Disregard other uses. Given that we're eliminating uses (shortening the
dependency chain), I think that's always the right IR canonicalization. But
I purposely chose the udiv test to demonstrate the scenario where both
intermediate values have other uses because that seems likely worse for
codegen with an expensive math op. This seems like a very rare possibility to
me, so I don't think it requires a backend patch first.
Differential Revision: https://reviews.llvm.org/D48401
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335283
91177308-0d34-0410-b5e6-
96231b3b80d8
Scott Linder [Thu, 21 Jun 2018 19:38:56 +0000 (19:38 +0000)]
[AMDGPU] Update assembler for HSA Code Object v3
Update AMDGPU assembler syntax behind the code-object-v3 feature:
* Replace/rename most AMDGPU assembler directives/symbols and document them.
* Provide more diagnostics (e.g. values out of range, missing values, repeated
values).
* Provide path for backwards compatibility, even with underlying descriptor
changes.
Differential Revision: https://reviews.llvm.org/D47736
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335281
91177308-0d34-0410-b5e6-
96231b3b80d8
Francis Visoiu Mistrih [Thu, 21 Jun 2018 19:18:36 +0000 (19:18 +0000)]
Revert r335206 "Recommit r333268: [IPSCCP] Use PredicateInfo to propagate facts from cmp instructions."
This reverts commit r335206.
As discussed here: https://reviews.llvm.org/rL333740, a fix will come
tomorrow. In the meanwhile, revert this to fix some bots.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335272
91177308-0d34-0410-b5e6-
96231b3b80d8
Simon Dardis [Thu, 21 Jun 2018 18:52:32 +0000 (18:52 +0000)]
[mips] Modify comment to test new email address (NFC).
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335269
91177308-0d34-0410-b5e6-
96231b3b80d8
Scott Linder [Thu, 21 Jun 2018 18:48:48 +0000 (18:48 +0000)]
[AMDGPU] Fix bug with tracking processed blocks in SIInsertWaitcnts
BlockWaitcntProcessedSet was not being cleared between calls, so it was
producing incorrect counts in cases where MBB addresses happened to coincide
across multiple calls.
Differential Revision: https://reviews.llvm.org/D48391
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335268
91177308-0d34-0410-b5e6-
96231b3b80d8
Konstantin Zhuravlyov [Thu, 21 Jun 2018 18:36:04 +0000 (18:36 +0000)]
AMDGPU/AMDHSA: Remove GridWorkGroupCountX/Y/Z
and everything that comes with it from implementation
and v3 header files.
Leave definition in v2 header files for backwards
compatibility.
Differential Revision: https://reviews.llvm.org/D48191
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335267
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Thu, 21 Jun 2018 18:07:38 +0000 (18:07 +0000)]
[InstCombine] add tests for shuffled cmps; NFC
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335266
91177308-0d34-0410-b5e6-
96231b3b80d8
Matt Davis [Thu, 21 Jun 2018 17:59:52 +0000 (17:59 +0000)]
[DebugInfo] Ignore DBG_VALUE instructions in PostRA Machine Sink
Summary:
The logic for handling the sinking of COPY instructions was generating
different code when building with debug flags.
The original code did not take into consideration debug instructions. This
resulted in the registers in the DBG_VALUE instructions being treated as used,
and prevented the COPY from being sunk. This patch avoids analyzing debug
instructions when trying to sink COPY instructions.
This patch also creates a routine from the code in MachineSinking::SinkInstruction to
perform the logic of sinking an instruction along with its debug instructions.
This functionality is used in multiple places, including the code for sinking COPY instrs.
Reviewers: junbuml, javed.absar, MatzeB, bjope
Reviewed By: bjope
Subscribers: aprantl, probinson, thegameg, jonpa, bjope, vsk, kristof.beyls, JDevlieghere, llvm-commits
Tags: #debug-info
Differential Revision: https://reviews.llvm.org/D45637
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335264
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Thu, 21 Jun 2018 17:51:44 +0000 (17:51 +0000)]
[InstCombine] use constant pattern matchers with icmp+sext
The previous code worked with vectors, but it failed when the
vector constants contained undef elements.
The matchers handle those cases.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335262
91177308-0d34-0410-b5e6-
96231b3b80d8
Sanjay Patel [Thu, 21 Jun 2018 17:37:14 +0000 (17:37 +0000)]
[InstCombine] add vector icmp tests with undefs; NFC
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@335261
91177308-0d34-0410-b5e6-
96231b3b80d8