OSDN Git Service

android-x86/external-llvm.git
7 years agoExclude more unused functions from release build.
Florian Hahn [Mon, 31 Jul 2017 16:44:28 +0000 (16:44 +0000)]
Exclude more unused functions from release build.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309576 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoExtend ifndef to printDebugLoc.
Florian Hahn [Mon, 31 Jul 2017 16:29:00 +0000 (16:29 +0000)]
Extend ifndef to printDebugLoc.

GCC7 did not warn about that, but Clang does.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309573 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoExtend ifdefs to more unused helper functions.
Florian Hahn [Mon, 31 Jul 2017 16:11:43 +0000 (16:11 +0000)]
Extend ifdefs to more unused helper functions.

This fixes a buildbot failure with -Werror introduced by r309553

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309572 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[DebugInfo] Don't overwrite DWARFUnit fields if the CU DIE doesn't have them.
Benjamin Kramer [Mon, 31 Jul 2017 15:32:39 +0000 (15:32 +0000)]
[DebugInfo] Don't overwrite DWARFUnit fields if the CU DIE doesn't have them.

DIEs are lazily deserialized so it's possible that the DWO CU is created
before the DIE is parsed. DWO shares .debug_addr and .debug_ranges with the
object file so overwriting the offset with 0 will make the CU unusable.

No test case because I couldn't get clang to emit a non-zero range base.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309570 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[docker] Fix unmatched quote problem in here-document on older versions of bash
Don Hinton [Mon, 31 Jul 2017 15:18:57 +0000 (15:18 +0000)]
[docker] Fix unmatched quote problem in here-document on older versions of bash

Summary:
When outputing usage, emit here-document directly instead of
saving in a variable first -- avoids problem with bash 3.2.57 where an
unmatched ' in the here-document results in the following error:

./build_docker_image.sh: line 135: unexpected EOF while looking for matching `''

bash --version
GNU bash, version 3.2.57(1)-release (x86_64-apple-darwin16)

Differential Revision: https://reviews.llvm.org/D36064

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309568 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[SLP] Initial rework for min/max horizontal reduction vectorization, NFC.
Alexey Bataev [Mon, 31 Jul 2017 14:36:05 +0000 (14:36 +0000)]
[SLP] Initial rework for min/max horizontal reduction vectorization, NFC.

Summary: All getReductionCost() functions are renamed to getArithmeticReductionCost() + added basic infrastructure to handle non-binary reduction operations.

Reviewers: spatel, mzolotukhin, Ayal, mkuper, gilr, hfinkel

Subscribers: RKSimon, llvm-commits

Differential Revision: https://reviews.llvm.org/D29402

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309566 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[X86] Extending a test cases for LEA factorization.
Simon Pilgrim [Mon, 31 Jul 2017 14:23:28 +0000 (14:23 +0000)]
[X86] Extending a test cases for LEA factorization.

Submitted on the behalf of Jatin Bhateja

Differential Revision: https://reviews.llvm.org/D36048

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309565 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[Cost] Rename getReductionCost() to getArithmeticReductionCost(), NFC.
Alexey Bataev [Mon, 31 Jul 2017 14:19:32 +0000 (14:19 +0000)]
[Cost] Rename getReductionCost() to getArithmeticReductionCost(), NFC.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309563 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[SelectionDAG][mips] Fix PR33883
Simon Dardis [Mon, 31 Jul 2017 14:06:58 +0000 (14:06 +0000)]
[SelectionDAG][mips] Fix PR33883

PR33883 shows that calls to intrinsic functions should not have their vector
arguments or returns subject to ABI changes required by the target.

This resolves PR33883.

Thanks to Alex Crichton for reporting the issue!

Reviewers: zoran.jovanovic, atanasyan

Differential Revision: https://reviews.llvm.org/D35765

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309561 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[LV] Avoid redundant operations manipulating masks
Ayal Zaks [Mon, 31 Jul 2017 13:21:42 +0000 (13:21 +0000)]
[LV] Avoid redundant operations manipulating masks

The Loop Vectorizer generates redundant operations when manipulating masks:
AND with true, OR with false, compare equal to true. Instead of relying on
a subsequent pass to clean them up, this patch avoids generating them.

Use null (no-mask) to represent all-one full masks, instead of a constant
all-one vector, following the convention of masked gathers and scatters.

Preparing for a follow-up VPlan patch in which these mask manipulating
operations are modeled using recipes.

Differential Revision: https://reviews.llvm.org/D35725

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309558 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[llvm-dlltool] Write correct weak externals
Martin Storsjo [Mon, 31 Jul 2017 11:18:41 +0000 (11:18 +0000)]
[llvm-dlltool] Write correct weak externals

Previously, the created object files for the import library were broken.
Write the symbol table before the string table. Simplify the code by
using a separate variable Prefix instead of duplicating a few lines.

Also update the coff-weak-exports to actually check that the generated
weak symbols can be found as intended.

Differential Revision: https://reviews.llvm.org/D36065

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309555 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoGuard print() functions only used by dump() functions.
Florian Hahn [Mon, 31 Jul 2017 10:07:49 +0000 (10:07 +0000)]
Guard print() functions only used by dump() functions.

Summary:
Since  r293359, most dump() function are only defined when
`!defined(NDEBUG) || defined(LLVM_ENABLE_DUMP)` holds. print() functions
only used by dump() functions are now unused in release builds,
generating lots of warnings. This patch only defines some print()
functions if they are used.

Reviewers: MatzeB

Reviewed By: MatzeB

Subscribers: arsenm, mzolotukhin, nhaehnle, llvm-commits

Differential Revision: https://reviews.llvm.org/D35949

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309553 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[Modules] llvm-config: Exclude CMAKE_CFG_INTDIR. It isn't used in headers.
NAKAMURA Takumi [Mon, 31 Jul 2017 10:07:13 +0000 (10:07 +0000)]
[Modules] llvm-config: Exclude CMAKE_CFG_INTDIR. It isn't used in headers.

This is part of https://reviews.llvm.org/D35559

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309552 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[Support/GlobPattern] - Do not crash when pattern has characters with int value < 0.
George Rimar [Mon, 31 Jul 2017 09:26:50 +0000 (09:26 +0000)]
[Support/GlobPattern] - Do not crash when pattern has characters with int value < 0.

Found it during work on LLD, it would crash on following
linker script:

SECTIONS { .foo : { *("*®") } }
That happens because ® has int value -82. And chars are used as
array index in code, and are signed by default.

Differential revision: https://reviews.llvm.org/D35891

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309549 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[LoopInterchange] Do not interchange loops with function calls.
Florian Hahn [Mon, 31 Jul 2017 09:00:52 +0000 (09:00 +0000)]
[LoopInterchange] Do not interchange loops with function calls.

Summary:
Without any information about the called function, we cannot be sure
that it is safe to interchange loops which contain function calls. For
example there could be dependences that prevent interchanging between
accesses in the called function and the loops. Even functions without any
parameters could cause problems, as they could access memory using
global pointers.

For now, I think it is only safe to interchange loops with calls marked
as readnone.

With this patch, the LLVM test suite passes with `-O3 -mllvm
-enable-loopinterchange` and LoopInterchangeProfitability::isProfitable
returning true for all loops. check-llvm and check-clang also pass when
bootstrapped in a similar fashion, although only 3 loops got
interchanged.

Reviewers: karthikthecool, blitz.opensource, hfinkel, mcrosier, mkuper

Reviewed By: mcrosier

Subscribers: mzolotukhin, llvm-commits

Differential Revision: https://reviews.llvm.org/D35489

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309547 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[X86][AVX512] Add masked MOVS[S|D] patterns
Guy Blank [Mon, 31 Jul 2017 08:26:14 +0000 (08:26 +0000)]
[X86][AVX512] Add masked MOVS[S|D] patterns

Added patterns to recognize AND 1 on the mask of a scalar masked
move is not needed since only the lower bit is relevant for the
instruction.

Differential Revision:
https://reviews.llvm.org/D35897

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309546 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[SLP]: Add test to resurrect the jumbled load patch. This test has multiple uses
Mohammad Shahid [Mon, 31 Jul 2017 07:40:54 +0000 (07:40 +0000)]
[SLP]: Add test to resurrect the jumbled load patch. This test has multiple uses
of memory loads by different user

Change-Id: I40b5ba8b810265440f3e55efca77c4b41ca98fa4

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309544 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[PowerPC] Change method names; NFC
Hiroshi Inoue [Mon, 31 Jul 2017 06:27:09 +0000 (06:27 +0000)]
[PowerPC] Change method names; NFC

Changed method names based on the discussion in https://reviews.llvm.org/D34986:
getInt64 -> selectI64Imm,
getInt64Count -> selectI64ImmInstrCount.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309541 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[X86] Add pattern to use bzhi for 64-bit 'and' with a mask when there is a load involved.
Craig Topper [Mon, 31 Jul 2017 05:55:54 +0000 (05:55 +0000)]
[X86] Add pattern to use bzhi for 64-bit 'and' with a mask when there is a load involved.

We already had a pattern without load, but with a load we were falling back to a regular 'and' due to pattern complexity priority.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309535 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agogold/CMakeLists.txt: Prune (-D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64).
NAKAMURA Takumi [Mon, 31 Jul 2017 00:39:22 +0000 (00:39 +0000)]
gold/CMakeLists.txt: Prune (-D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64).

They are handled in HandleLLVMOptions.cmake for -m32.
They are not required for -m64.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309532 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoPrune trailing linefeed at eof.
NAKAMURA Takumi [Mon, 31 Jul 2017 00:39:19 +0000 (00:39 +0000)]
Prune trailing linefeed at eof.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309531 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoDebugInfo: Fix r309526, ensure resetting base address selection entries are used
David Blaikie [Mon, 31 Jul 2017 00:18:24 +0000 (00:18 +0000)]
DebugInfo: Fix r309526, ensure resetting base address selection entries are used

Missed the resetting base address selections when going from a base
address version to zero base address for non-base-addressed entries.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309529 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoDebugInfo: Use base address selection entries in debug_ranges to reduce relocations
David Blaikie [Sun, 30 Jul 2017 22:10:00 +0000 (22:10 +0000)]
DebugInfo: Use base address selection entries in debug_ranges to reduce relocations

(from comments in the test)
Group ranges in a range list that apply to the same section and use a base
address selection entry to reduce the number of relocations to one reloc per
section per range list. DWARF5 debug_rnglist will be more efficient than this
in terms of relocations, but it's still better than one reloc per entry in a
range list.

This is an object/executable size tradeoff - shrinking objects, but growing
the linked executable. In one large binary tested, total object size (not just
debug info) shrank by 16%, entirely relocation entries. Linked executable
grew by 4%. This was with compressed debug info in the objects, uncompressed
in the linked executable. Without compression in the objects, the win would be
smaller (the growth of debug_ranges itself would be more significant).

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309526 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agotest: add an additional cfi_return_column test
Saleem Abdulrasool [Sun, 30 Jul 2017 21:30:54 +0000 (21:30 +0000)]
test: add an additional cfi_return_column test

Ensure that we still coalesce identical CIEs across FDEs even with
cfi_return_column alterations.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309525 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agotest: make the test clearer (NFC)
Saleem Abdulrasool [Sun, 30 Jul 2017 21:30:53 +0000 (21:30 +0000)]
test: make the test clearer (NFC)

Use `llvm-objdump -dwarf=frames` to dump the .eh_frame to validate the
output textually rather than compare the binary output.  This makes it
easier to see what is being checked.  NFC.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309524 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoNFC: spell correction.
Lama Saba [Sun, 30 Jul 2017 20:12:17 +0000 (20:12 +0000)]
NFC: spell correction.

On behalf of jbhateja

Differential Revision: https://reviews.llvm.org/D35885

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309521 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoFix typo in comment
Tobias Grosser [Sun, 30 Jul 2017 18:01:16 +0000 (18:01 +0000)]
Fix typo in comment

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309519 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agollvm-symbolizer/print_context.c test: Make debug info path independent
David Blaikie [Sun, 30 Jul 2017 17:26:34 +0000 (17:26 +0000)]
llvm-symbolizer/print_context.c test: Make debug info path independent

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309518 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agollvm-symbolizer: Make test portable using an explicit object file rather than the...
David Blaikie [Sun, 30 Jul 2017 17:16:32 +0000 (17:16 +0000)]
llvm-symbolizer: Make test portable using an explicit object file rather than the host compiler

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309517 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoMake test robust to changes in prefix/avoid hardcoded line numbers
David Blaikie [Sun, 30 Jul 2017 16:05:26 +0000 (16:05 +0000)]
Make test robust to changes in prefix/avoid hardcoded line numbers

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309516 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRevert "[AVR] Mark a failing symbolizer test as XFAIL"
Dylan McKay [Sun, 30 Jul 2017 15:38:07 +0000 (15:38 +0000)]
Revert "[AVR] Mark a failing symbolizer test as XFAIL"

This reverts commit 83a0e876349adb646ba858eb177b22b0b4bfc59a.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309515 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoDebugInfo: Fix for CU index usage in 309507
David Blaikie [Sun, 30 Jul 2017 15:15:58 +0000 (15:15 +0000)]
DebugInfo: Fix for CU index usage in 309507

Not sure quite how I failed so clearly to test this, but anyway.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309514 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[AVR] Mark a failing symbolizer test as XFAIL
Dylan McKay [Sun, 30 Jul 2017 14:55:11 +0000 (14:55 +0000)]
[AVR] Mark a failing symbolizer test as XFAIL

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309512 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoExpanding the test case for vf8 for stride 4 interleaved.
Michael Zuckerman [Sun, 30 Jul 2017 11:54:57 +0000 (11:54 +0000)]
Expanding the test case for vf8 for stride 4 interleaved.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309511 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[x86][inline-asm][ms-compat] legalize the use of "jc/jz short <op>"
Coby Tayree [Sun, 30 Jul 2017 11:12:47 +0000 (11:12 +0000)]
[x86][inline-asm][ms-compat] legalize the use of "jc/jz short <op>"

MS ignores the keyword "short" when used after a jc/jz instruction, LLVM ought to do the same.
Test: D35893

Differential Revision: https://reviews.llvm.org/D35892

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309509 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoDebugInfo: Use DWP cu_index to speed up symbolizing (as intended)
David Blaikie [Sun, 30 Jul 2017 08:12:07 +0000 (08:12 +0000)]
DebugInfo: Use DWP cu_index to speed up symbolizing (as intended)

I was a bit lazy when I first implemented this & skipped the index
lookup - obviously for large files this becomes pretty crucial, so here
we go, do the index lookup. Speeds up large DWP symbolizing by... lots.
(20m -> 20s, actually, maybe more in a release build (that was a release
build without index lookup, compared to a debug/non-release build with
the index usage))

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309507 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoDebugInfo: Group member variable along with the rest
David Blaikie [Sun, 30 Jul 2017 08:12:05 +0000 (08:12 +0000)]
DebugInfo: Group member variable along with the rest

Committed in r309498 I didn't spot where the rest of the private members
were in DWARFContext at the time - group them up again.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309506 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[X86] Add addsub intrinsics to the intrinsic lowering table so we have a single set...
Craig Topper [Sun, 30 Jul 2017 06:02:59 +0000 (06:02 +0000)]
[X86] Add addsub intrinsics to the intrinsic lowering table so we have a single set of isel patterns.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309502 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRefactor the build{Module|Function}SimplificationPipeline to expose optimization...
Dehao Chen [Sun, 30 Jul 2017 04:55:39 +0000 (04:55 +0000)]
Refactor the build{Module|Function}SimplificationPipeline to expose optimization phase.

Summary: This is in preparation of https://reviews.llvm.org/D36052

Reviewers: chandlerc, davidxl, tejohnson

Reviewed By: chandlerc

Subscribers: sanjoy, llvm-commits

Differential Revision: https://reviews.llvm.org/D36053

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309500 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoDebugInfo: Provide option for explicitly specifying the name of the DWP file
David Blaikie [Sun, 30 Jul 2017 01:34:08 +0000 (01:34 +0000)]
DebugInfo: Provide option for explicitly specifying the name of the DWP file

If you've archived the DWP file somewhere it's probably useful to be
able to just tell llvm-symbolizer where it is when you're symbolizing
stack traces from the binary.

This only provides a mechanism for specifying a single DWP file, good if
you're symbolizing a program with a single DWP file, but it's likely if
the program is dynamically linked that you might have a DWP for each
dynamic library - in which case this feature won't help (at least as
it's surfaced in llvm-symbolizer for now) - in theory it could be
extended to specify a collection of DWP files that could all be
consulted for split CU hash resolution.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309498 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoMigrate PGOMemOptSizeOpt to use new OptimizationRemarkEmitter Pass
Sam Elliott [Sun, 30 Jul 2017 00:35:33 +0000 (00:35 +0000)]
Migrate PGOMemOptSizeOpt to use new OptimizationRemarkEmitter Pass

Summary:
Fixes PR33790.

This patch still needs a yaml-style test, which I shall write tomorrow

Reviewers: anemet

Reviewed By: anemet

Subscribers: anemet, llvm-commits

Differential Revision: https://reviews.llvm.org/D35981

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309497 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[AArch64] Tie source and destination operands for AESMC/AESIMC.
Florian Hahn [Sat, 29 Jul 2017 20:35:28 +0000 (20:35 +0000)]
[AArch64] Tie source and destination operands for AESMC/AESIMC.

Summary:
Most CPUs implementing AES fusion require instruction pairs of the form
    AESE Vn, _
    AESMC Vn, Vn
and
    AESD Vn, _
    AESIMC Vn, Vn

The constraint is added to AES(I)MC instructions which use the result of
an AES(E|D) instruction by using AES(I)MCTrr pseudo instructions, which
constraint source and destination registers to be the same.

A nice side effect of this change is that now all possible pairs are
scheduled back-to-back on the exynos-m1 for the misched-fusion-aes.ll
test case.

I had to update aes_load_store. The version I added initially was very
reduced and with the new constraint, AESE/AESMC could not be scheduled
back-to-back. I updated the test to be more realistic and still expose
the same scheduling problem as the initial test case.

Reviewers: t.p.northover, rengolin, evandro, kristof.beyls, silviu.baranga

Reviewed By: t.p.northover, evandro

Subscribers: aemerson, javed.absar, llvm-commits

Differential Revision: https://reviews.llvm.org/D35299

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309495 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[AArch64] Use 8 bytes as preferred function alignment on Cortex-A53.
Florian Hahn [Sat, 29 Jul 2017 20:04:54 +0000 (20:04 +0000)]
[AArch64] Use 8 bytes as preferred function alignment on Cortex-A53.

Summary:
This change gives a 0.25% speedup on execution time, a 0.82% improvement
in benchmark scores and a 0.20% increase in binary size on a Cortex-A53.
These numbers are the geomean results on a wide range of benchmarks from
the test-suite and a range of proprietary suites.

Reviewers: t.p.northover, aadg, silviu.baranga, mcrosier, rengolin

Reviewed By: rengolin

Subscribers: grimar, davide, aemerson, rengolin, javed.absar, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D35568

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309494 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoMC: simplify internal function call parameter
Saleem Abdulrasool [Sat, 29 Jul 2017 20:03:02 +0000 (20:03 +0000)]
MC: simplify internal function call parameter

Rather than passing along most of the parameters, pass a reference to
the MCDWARFrameInfo instead.  This makes it easier to pass additional
information about the frame to the checks.  We need to keep the extra
constructor for the Key around to allow the construction of the null and
tombstone keys.  NFC.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309493 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoMC: account for the return column in the CIE key
Saleem Abdulrasool [Sat, 29 Jul 2017 20:03:00 +0000 (20:03 +0000)]
MC: account for the return column in the CIE key

If the return column is different, we cannot coalesce the CIE across the
FDEs.  Add that to the key calculation.  This ensures that we emit a
separate CIE.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309492 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoFix test failure without X86 backend
Hiroshi Inoue [Sat, 29 Jul 2017 15:03:31 +0000 (15:03 +0000)]
Fix test failure without X86 backend

move test/Transforms/SimplifyCFG/disable-lookup-table.ll into test/Transforms/SimplifyCFG/X86/disable-lookup-table.ll to avoid test failure when X86 backend is not enabled

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309487 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[SelectionDAG][X86] CombineBT - more aggressively determine demanded bits
Simon Pilgrim [Sat, 29 Jul 2017 14:50:25 +0000 (14:50 +0000)]
[SelectionDAG][X86] CombineBT - more aggressively determine demanded bits

This patch is in 2 parts:

1 - replace combineBT's use of SimplifyDemandedBits (hasOneUse only) with SelectionDAG::GetDemandedBits to more aggressively determine the lower bits used by BT.

2 - update SelectionDAG::GetDemandedBits to support ANY_EXTEND - if the demanded bits are only in the non-extended portion, then peek through and demand from the source value and then ANY_EXTEND that if we found a match.

Differential Revision: https://reviews.llvm.org/D35896

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309486 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[tests] Do not emity binary bitcode to stdout in RegionInfo tests
Tobias Grosser [Sat, 29 Jul 2017 09:58:43 +0000 (09:58 +0000)]
[tests] Do not emity binary bitcode to stdout in RegionInfo tests

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309485 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[OCaml] Pass -D/-UNDEBUG through to ocamlc
Michal Gorny [Sat, 29 Jul 2017 08:10:24 +0000 (08:10 +0000)]
[OCaml] Pass -D/-UNDEBUG through to ocamlc

Detect [/-][DU]NDEBUG in CMAKE_C_FLAGS* and pass them through to ocamlc.
This is necessary because their value might affect visibility of dump
functions in LLVM and ocamlc uses its own compiler and flags by default.

Differential Revision: https://reviews.llvm.org/D35898

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309483 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoUpdate the test to make windows bot pass.
Dehao Chen [Sat, 29 Jul 2017 07:01:25 +0000 (07:01 +0000)]
Update the test to make windows bot pass.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309482 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[OCaml] Install dynamic libraries in 'stubdirs' directory
Michal Gorny [Sat, 29 Jul 2017 06:46:45 +0000 (06:46 +0000)]
[OCaml] Install dynamic libraries in 'stubdirs' directory

Install the OCaml dynamic libraries in the 'stubdirs' directory rather
than the llvm subdirectory in order to fix running executables created
by ocamlc. Otherwise, the executables fail to run being unable to locate
the libraries (unless the LLVM directory is explicitly added to
LD_LIBRARY_PATH).

The staging directories are not altered since they work for our
development setup anyway, and installing into two directories would
unnecessarily make the code more complex.

Differential Revision: https://reviews.llvm.org/D35995

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309481 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[SCEV] Change an early exit to an assert; NFC
Sanjoy Das [Sat, 29 Jul 2017 05:32:47 +0000 (05:32 +0000)]
[SCEV] Change an early exit to an assert; NFC

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309480 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoupdate the test file that was omitted in r309478.
Dehao Chen [Sat, 29 Jul 2017 04:11:20 +0000 (04:11 +0000)]
update the test file that was omitted in r309478.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309479 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRefine the PGOOpt and SamplePGOSupport handling.
Dehao Chen [Sat, 29 Jul 2017 04:10:24 +0000 (04:10 +0000)]
Refine the PGOOpt and SamplePGOSupport handling.

Summary:
Now that SamplePGOSupport is part of PGOOpt, there are several places that need tweaking:
1. AddDiscriminator pass should *not* be invoked at ThinLTOBackend (as it's already invoked in the PreLink phase)
2. addPGOInstrPasses should only be invoked when either ProfileGenFile or ProfileUseFile is non-empty.
3. SampleProfileLoaderPass should only be invoked when SampleProfileFile is non-empty.
4. PGOIndirectCallPromotion should only be invoked in ProfileUse phase, or in ThinLTOBackend of SamplePGO.

Reviewers: chandlerc, tejohnson, davidxl

Reviewed By: chandlerc

Subscribers: sanjoy, mehdi_amini, eraman, llvm-commits

Differential Revision: https://reviews.llvm.org/D36040

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309478 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Remove deadcode from AMDGPUInstPrinter
Tom Stellard [Sat, 29 Jul 2017 03:56:53 +0000 (03:56 +0000)]
AMDGPU: Remove deadcode from AMDGPUInstPrinter

Reviewers: arsenm

Reviewed By: arsenm

Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, llvm-commits, t-tye

Differential Revision: https://reviews.llvm.org/D36034

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309477 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Move INDIRECT_BASE_ADDR definition out of common files
Tom Stellard [Sat, 29 Jul 2017 03:44:07 +0000 (03:44 +0000)]
AMDGPU: Move INDIRECT_BASE_ADDR definition out of common files

Summary: This is only used by R600.

Reviewers: arsenm

Reviewed By: arsenm

Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, llvm-commits, t-tye

Differential Revision: https://reviews.llvm.org/D35926

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309476 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[MachineOutliner] NFC: Change IsTailCall to a call class + frame class
Jessica Paquette [Sat, 29 Jul 2017 02:55:46 +0000 (02:55 +0000)]
[MachineOutliner] NFC: Change IsTailCall to a call class + frame class

This commit

- Removes IsTailCall and replaces it with a target-defined unsigned
- Refactors getOutliningCallOverhead and getOutliningFrameOverhead so that they don't use IsTailCall
- Adds a call class + frame class classification to OutlinedFunction and Candidate respectively

This accomplishes a couple things.

Firstly, we don't need the notion of *tail call* in the general outlining algorithm.

Secondly, we now can have different "outlining classes" for each candidate within a set of candidates.
This will make it easy to add new ways to outline sequences for certain targets and dynamically choose
an appropriate cost model for a sequence depending on the context that that sequence lives in.

Ultimately, this should get us closer to being able to do something like, say avoid saving the link
register when outlining AArch64 instructions.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309475 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agolit::shtest-format.py: Make write-bad-encoding.py py3-aware.
NAKAMURA Takumi [Sat, 29 Jul 2017 02:52:56 +0000 (02:52 +0000)]
lit::shtest-format.py: Make write-bad-encoding.py py3-aware.

  Traceback (most recent call last):
    File "llvm/utils/lit/tests/Inputs/shtest-format/external_shell/write-bad-encoding.py", line 5, in <module>
      sys.stdout.write(b"a line with bad encoding: \xc2.")

sys.stdout.write doesn't accept bytes but sys.stdout.buffer.write accepts.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309473 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Make areMemAccessesTriviallyDisjoint more aware of segment flat
Matt Arsenault [Sat, 29 Jul 2017 01:26:21 +0000 (01:26 +0000)]
AMDGPU: Make areMemAccessesTriviallyDisjoint more aware of segment flat

Checking the encoding is insufficient since now there can
be global or scratch instructions.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309472 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Teach isLegalAddressingMode about global_* instructions
Matt Arsenault [Sat, 29 Jul 2017 01:12:31 +0000 (01:12 +0000)]
AMDGPU: Teach isLegalAddressingMode about global_* instructions

Also refine the flat check to respect flat-for-global feature,
and constant fallback should check global handling, not
specifically MUBUF.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309471 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Start selecting global instructions
Matt Arsenault [Sat, 29 Jul 2017 01:03:53 +0000 (01:03 +0000)]
AMDGPU: Start selecting global instructions

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309470 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[Hexagon] Fix some Clang-tidy modernize-use-using and Include What You Use warnings...
Eugene Zelenko [Sat, 29 Jul 2017 00:56:56 +0000 (00:56 +0000)]
[Hexagon] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC).

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309469 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[llvm] Update MachOObjectFile::exports interface
Alexander Shaposhnikov [Sat, 29 Jul 2017 00:30:45 +0000 (00:30 +0000)]
[llvm] Update MachOObjectFile::exports interface

This diff removes the second argument of the method MachOObjectFile::exports.
In all in-tree uses this argument is equal to "this" and
without this argument the interface seems to be cleaner.

Test plan: make check-all

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309462 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoFix update_llc_test_checks.py ARM parsing
Eli Friedman [Fri, 28 Jul 2017 23:58:24 +0000 (23:58 +0000)]
Fix update_llc_test_checks.py ARM parsing

When I tried running the script, the ARM regex parser could not parse
my code. It failed because the .Lfunc_end line has a comment at the
end of it, so this commit removes the newline at the end of the regex.

Patch by Joel Galenson!

Differential Revision: https://reviews.llvm.org/D35641

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309457 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[LTO] llvm-lto2: Add option to load sample profile
Tobias Edler von Koch [Fri, 28 Jul 2017 23:43:22 +0000 (23:43 +0000)]
[LTO] llvm-lto2: Add option to load sample profile

Summary:
This exposes LTO's Conf.SampleProfile as a command line option
(-lto-sample-profile-file) for testing via the llvm-lto2 utility.

Reviewers: pcc, danielcdh

Subscribers: mehdi_amini, inglorion, llvm-commits

Differential Revision: https://reviews.llvm.org/D36030

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309456 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRemove the unused offset field from LiveDebugValues (NFC)
Adrian Prantl [Fri, 28 Jul 2017 23:25:51 +0000 (23:25 +0000)]
Remove the unused offset field from LiveDebugValues (NFC)

Followup to r309426.
rdar://problem/33580047

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309455 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRemove the unused offset field from LiveDebugVariables (NFC)
Adrian Prantl [Fri, 28 Jul 2017 23:06:50 +0000 (23:06 +0000)]
Remove the unused offset field from LiveDebugVariables (NFC)

Followup to r309426.
rdar://problem/33580047

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309451 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRemove the unused offset from DBG_VALUE (NFC)
Adrian Prantl [Fri, 28 Jul 2017 23:00:45 +0000 (23:00 +0000)]
Remove the unused offset from DBG_VALUE (NFC)

Followup to r309426.
rdar://problem/33580047

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309450 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRemove the unused DBG_VALUE offset parameter from GlobalISel (NFC)
Adrian Prantl [Fri, 28 Jul 2017 22:46:20 +0000 (22:46 +0000)]
Remove the unused DBG_VALUE offset parameter from GlobalISel (NFC)

Followup to r309426.
rdar://problem/33580047

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309449 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoUpdate the Go bindings for r309426 (remove offset from llvm.dbg.value)
Adrian Prantl [Fri, 28 Jul 2017 22:44:44 +0000 (22:44 +0000)]
Update the Go bindings for r309426 (remove offset from llvm.dbg.value)

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309448 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAdded tests for i8 interleaved-load-pattern of stride=4, VF=(8, 16, 32).
Farhana Aleen [Fri, 28 Jul 2017 22:43:34 +0000 (22:43 +0000)]
Added tests for i8 interleaved-load-pattern of stride=4, VF=(8, 16, 32).

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309447 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRemove the unused DBG_VALUE offset parameter from RegAllocFast (NFC)
Adrian Prantl [Fri, 28 Jul 2017 22:36:55 +0000 (22:36 +0000)]
Remove the unused DBG_VALUE offset parameter from RegAllocFast (NFC)

Followup to r309426.
rdar://problem/33580047

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309446 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAdd documentation for the attribute "no-jump-tables"
Sumanth Gundapaneni [Fri, 28 Jul 2017 22:26:22 +0000 (22:26 +0000)]
Add documentation for the attribute "no-jump-tables"

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309445 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[SimplifyCFG] Make the no-jump-tables attribute also disable switch lookup tables
Sumanth Gundapaneni [Fri, 28 Jul 2017 22:25:40 +0000 (22:25 +0000)]
[SimplifyCFG] Make the no-jump-tables attribute also disable switch lookup tables

Differential Revision: https://reviews.llvm.org/D35579

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309444 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[libFuzzer] improve support for inline-8bit-counters (make it more correct and faster)
Kostya Serebryany [Fri, 28 Jul 2017 22:00:56 +0000 (22:00 +0000)]
[libFuzzer] improve support for inline-8bit-counters (make it more correct and faster)

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309443 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[Hexagon] Formatting changes, NFC
Krzysztof Parzyszek [Fri, 28 Jul 2017 21:52:21 +0000 (21:52 +0000)]
[Hexagon] Formatting changes, NFC

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309442 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[Inliner] Do not apply any bonus for cold callsites.
Easwaran Raman [Fri, 28 Jul 2017 21:47:36 +0000 (21:47 +0000)]
[Inliner] Do not apply any bonus for cold callsites.

Summary:
Inlining threshold is increased by application of bonuses when the
callee has a single reachable basic block or is rich in vector
instructions. Similarly, inlining cost is reduced by applying a large
bonus when the last call to a static function is considered for
inlining. This patch disables the application of these bonuses when the
callsite or the callee is cold. The intention here is to prevent a large
cold callsite from being inlined to a non-cold caller that could prevent
the caller from being inlined. This is especially important when the
cold callsite is a last call to a static since the associated bonus is
very high.

Reviewers: chandlerc, davidxl

Subscribers: danielcdh, llvm-commits

Differential Revision: https://reviews.llvm.org/D35823

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309441 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRemove the unused dbg.value offset from SelectionDAG (NFC)
Adrian Prantl [Fri, 28 Jul 2017 21:27:35 +0000 (21:27 +0000)]
Remove the unused dbg.value offset from SelectionDAG (NFC)

Followup to r309426.
rdar://problem/33580047

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309436 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[lit] Use a %{python} substitution to avoid relying on python being on PATH
Reid Kleckner [Fri, 28 Jul 2017 21:13:47 +0000 (21:13 +0000)]
[lit] Use a %{python} substitution to avoid relying on python being on PATH

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309434 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[lit] Remove stale test inputs before running check-lit
Reid Kleckner [Fri, 28 Jul 2017 21:00:57 +0000 (21:00 +0000)]
[lit] Remove stale test inputs before running check-lit

This should fix googletest-format test failures on the clang modules
buildbots, which have a stale copy of the OneTest script in the build
directory.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309432 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoReword sentence in LangRef
Adrian Prantl [Fri, 28 Jul 2017 20:44:29 +0000 (20:44 +0000)]
Reword sentence in LangRef

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309431 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoRemove the obsolete offset parameter from @llvm.dbg.value
Adrian Prantl [Fri, 28 Jul 2017 20:21:02 +0000 (20:21 +0000)]
Remove the obsolete offset parameter from @llvm.dbg.value

There is no situation where this rarely-used argument cannot be
substituted with a DIExpression and removing it allows us to simplify
the DWARF backend. Note that this patch does not yet remove any of
the newly dead code.

rdar://problem/33580047
Differential Revision: https://reviews.llvm.org/D35951

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309426 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[SLP] Allow vectorization of the instruction from the same basic blocks only, NFC.
Alexey Bataev [Fri, 28 Jul 2017 20:11:16 +0000 (20:11 +0000)]
[SLP] Allow vectorization of the instruction from the same basic blocks only, NFC.

Summary:
After some changes in SLP vectorizer we missed some additional checks to
limit the instructions for vectorization. We should not perform analysis
of the instructions if the parent of instruction is not the same as the
parent of the first instruction in the tree or it was analyzed already.

Subscribers: mzolotukhin

Differential Revision: https://reviews.llvm.org/D34881

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309425 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoFix conditional tail call branch folding when both edges are the same
Reid Kleckner [Fri, 28 Jul 2017 19:48:40 +0000 (19:48 +0000)]
Fix conditional tail call branch folding when both edges are the same

The conditional tail call logic did the wrong thing when both
destinations of a conditional branch were the same:

BB#1: derived from LLVM BB %entry
    Live Ins: %EFLAGS
    Predecessors according to CFG: BB#0
        JE_1 <BB#5>, %EFLAGS<imp-use,kill>
        JMP_1 <BB#5>

BB#5: derived from LLVM BB %sw.epilog
    Predecessors according to CFG: BB#1
        TCRETURNdi64 <ga:@mergeable_conditional_tailcall>, 0, ...

We would fold the JE_1 to a TCRETURNdi64cc, and then remove our BB#5
successor. Then BB#5 would be deleted as it had no predecessors, leaving
a dangling "JMP_1 <BB#5>" reference behind to cause assertions later.

This patch checks that both conditional branch destinations are
different before doing the transform. The standard branch folding logic
is able to remove both the JMP_1 and the JE_1, and for my test case we
end up forming a better conditional tail call later.

Fixes PR33980

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309422 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Look through a bitcast user of an out argument
Matt Arsenault [Fri, 28 Jul 2017 19:06:16 +0000 (19:06 +0000)]
AMDGPU: Look through a bitcast user of an out argument

This allows handling of a lot more of the interesting
cases in Blender. Most of the large functions unlikely
to be inlined have this pattern.

This is a special case for what clang emits for OpenCL 3
element vectors. Annoyingly, these are emitted as
<3 x elt>* pointers, but accessed as <4 x elt>* operations.
This also needs to handle cases where a struct containing
a single vector is used.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309419 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[Value Tracking] Refactor icmp comparison logic into helper. NFC.
Chad Rosier [Fri, 28 Jul 2017 18:47:43 +0000 (18:47 +0000)]
[Value Tracking] Refactor icmp comparison logic into helper. NFC.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309417 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Add pass to replace out arguments
Matt Arsenault [Fri, 28 Jul 2017 18:40:05 +0000 (18:40 +0000)]
AMDGPU: Add pass to replace out arguments

It is better to return arguments directly in registers
if we are making a call rather than introducing expensive
stack usage. In one of sample compile from one of
Blender's many kernel variants, this fires on about
~20 different functions. Future improvements may be to
recognize simple cases where the pointer is indexing a small
array. This also fails when the store to the out argument
is in a separate block from the return, which happens in
a few of the Blender functions. This should also probably
be using MemorySSA which might help with that.

I'm not sure this is correct as a FunctionPass, but
MemoryDependenceAnalysis seems to not work with
a ModulePass.

I'm also not sure where it should run.I think it should
run  before DeadArgumentElimination, so maybe either
EP_CGSCCOptimizerLate or EP_ScalarOptimizerLate.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309416 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[LVI] Constant-propagate a zero extension of the switch condition value through case...
Hiroshi Yamauchi [Fri, 28 Jul 2017 18:35:25 +0000 (18:35 +0000)]
[LVI] Constant-propagate a zero extension of the switch condition value through case edges

Summary:
LazyValueInfo currently computes the constant value of the switch condition through case edges, which allows the constant value to be propagated through the case edges.

But we have seen a case where a zero-extended value of the switch condition is used past case edges for which the constant propagation doesn't occur.

This patch adds a small logic to handle such a case in getEdgeValueLocal().

This is motivated by the Python 2.7 eval loop in PyEval_EvalFrameEx() where the lack of the constant propagation causes longer live ranges and more spill code than necessary.

With this patch, we see that the code size of PyEval_EvalFrameEx() decreases by ~5.4% and a performance test improves by ~4.6%.

Reviewers: wmi, dberlin, sanjoy

Reviewed By: sanjoy

Subscribers: davide, davidxl, llvm-commits

Differential Revision: https://reviews.llvm.org/D34822

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309415 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoGlobalISel: map 128-bit values to an FPR by default.
Tim Northover [Fri, 28 Jul 2017 17:11:01 +0000 (17:11 +0000)]
GlobalISel: map 128-bit values to an FPR by default.

Eventually we may want to allow a pair of GPRs but absolutely nothing in the
entire world is ready for that yet.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309404 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[lit] Dump some FileCheck inputs to try to debug some failing tests
Reid Kleckner [Fri, 28 Jul 2017 16:24:18 +0000 (16:24 +0000)]
[lit] Dump some FileCheck inputs to try to debug some failing tests

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309400 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[lit] Fix shtest-format external_shell failures
Reid Kleckner [Fri, 28 Jul 2017 16:13:02 +0000 (16:13 +0000)]
[lit] Fix shtest-format external_shell failures

When using win32 cmd.exe, turn off command echoing at the beginning of
the script (@echo off).

Replace a bash shell script with a python script for the
fail_with_bad_encoding test.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309399 91177308-0d34-0410-b5e6-96231b3b80d8

7 years agoAMDGPU: Annotate implicitarg.ptr usage
Matt Arsenault [Fri, 28 Jul 2017 15:52:08 +0000 (15:52 +0000)]
AMDGPU: Annotate implicitarg.ptr usage

We need to pass something to functions for this to work.
It isn't derivable just from the kernarg segment pointer
because the implicit arguments are placed after the
kernel arguments.

Also fixes missing test for the intrinsic.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309398 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[GVN] Recommit the patch "Add phi-translate support in scalarpre"
Wei Mi [Fri, 28 Jul 2017 15:47:25 +0000 (15:47 +0000)]
[GVN] Recommit the patch "Add phi-translate support in scalarpre"

Recommit after workaround the bug PR31652.

Three bugs fixed in previous recommits: The first one is to use CurrentBlock
instead of PREInstr's Parent as param of performScalarPREInsertion because
the Parent of a clone instruction may be uninitialized. The second one is stop
PRE when CurrentBlock to its predecessor is a backedge and an operand of CurInst
is defined inside of CurrentBlock. The same value defined inside of loop in last
iteration can not be regarded as available. The third one is an out-of-bound
array access in a flipped if guard.

Right now scalarpre doesn't have phi-translate support, so it will miss some
simple pre opportunities. Like the following testcase, current scalarpre cannot
recognize the last "a * b" is fully redundent because a and b used by the last
"a * b" expr are both defined by phis.

long a[100], b[100], g1, g2, g3;
__attribute__((pure)) long goo();

void foo(long a, long b, long c, long d) {

  g1 = a * b;
  if (__builtin_expect(g2 > 3, 0)) {
    a = c;
    b = d;
    g2 = a * b;
  }
  g3 = a * b;      // fully redundant.

}

The patch adds phi-translate support in scalarpre. This is only a temporary
solution before the newpre based on newgvn is available.

Differential Revision: https://reviews.llvm.org/D32252

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309397 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[CMake] NFC. Add intrinsics_gen target to CMake Exports
Chris Bieneman [Fri, 28 Jul 2017 15:33:35 +0000 (15:33 +0000)]
[CMake]  NFC. Add intrinsics_gen target to CMake Exports

By creating a dummy of this target in LLVMConfig.cmake, projects that can build against out-of-tree LLVM can freely depend on the target without needing to have conditionals for if LLVM is in-tree or out-of-tree.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309389 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[ValueTracking] Remove a number of unused arguments. NFC.
Chad Rosier [Fri, 28 Jul 2017 14:39:06 +0000 (14:39 +0000)]
[ValueTracking] Remove a number of unused arguments. NFC.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309385 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[AArch64] Standardize suffixes for LSE Atomics mnemonics (NFCI)
Joel Jones [Fri, 28 Jul 2017 14:09:24 +0000 (14:09 +0000)]
[AArch64] Standardize suffixes for LSE Atomics mnemonics (NFCI)

This NFC changeset standardizes the suffixes used for LSE Atomics
instructions.

It changes the existing suffixes - 'b', 'h', 's', 'd' - to the existing
standard 'B', 'H', 'W' and 'X'.

This changeset is the result of the code review discussion for D35319.

Patch by: steleman

Differential Revision: https://reviews.llvm.org/D35927

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309384 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[ARM] Add the option to directly access TLS pointer
Strahinja Petrovic [Fri, 28 Jul 2017 12:54:57 +0000 (12:54 +0000)]
[ARM] Add the option to directly access TLS pointer

This patch enables choice for accessing thread local
storage pointer (like '-mtp' in gcc).

Differential Revision: https://reviews.llvm.org/D34408

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309381 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[X86] Add test case for PR33290
Simon Pilgrim [Fri, 28 Jul 2017 09:43:52 +0000 (09:43 +0000)]
[X86] Add test case for PR33290

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309375 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[X86][AVX] Cleanup shuffle combine tests - remove old prefixes.
Simon Pilgrim [Fri, 28 Jul 2017 09:41:55 +0000 (09:41 +0000)]
[X86][AVX] Cleanup shuffle combine tests - remove old prefixes.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309374 91177308-0d34-0410-b5e6-96231b3b80d8

7 years ago[ARM] Add test to check pcs of ARM ABI runtime floating point helpers
Peter Smith [Fri, 28 Jul 2017 09:21:00 +0000 (09:21 +0000)]
[ARM] Add test to check pcs of ARM ABI runtime floating point helpers

The ARM Runtime ABI document (IHI0043) defines the AEABI floating point
helper functions in section 4.1.2 The floating-point helper functions.
The functions listed in this section must always use the base AAPCS calling
convention.

This test generates calls to all the helper functions that llvm supports
and checks that the base AAPCS calling convention has been used. We test
the equivalent of -mfloat-abi=soft, -mfloat-abi=softfp, -mfloat-abi=hardfp
with an FPU that supports single and double precision, and one that only
supports double precision.

Differential Revision: https://reviews.llvm.org/D35904

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@309371 91177308-0d34-0410-b5e6-96231b3b80d8