Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1389 commits
Select commit Hold shift + click to select a range
9a37f35
ARROW-12108: [Rust] [DataFusion] Implement SHOW TABLES
alamb Mar 31, 2021
af4d5f4
ARROW-12028 ARROW-11940: [Rust][DataFusion] Add TimestampMillisecond …
Mar 31, 2021
5bfae36
ARROW-12165: [Rust] inline append functions of builders
ritchie46 Mar 31, 2021
ab435da
ARROW-12143: [CI] R builds should timeout and fail after some thresho…
westonpace Mar 31, 2021
5216e04
PARQUET-1990: [C++] Refuse to write ConvertedType::NA
pitrou Apr 1, 2021
971a9d3
ARROW-12110: [Java] Implement ZSTD compression
Apr 1, 2021
223a067
ARROW-11858: [GLib][Gandiva] Add Gandiva::Filter and related functions
kou Apr 1, 2021
cc3bcf8
ARROW-12171: [Rust] clean up clippy lints
alamb Apr 1, 2021
caa8950
ARROW-12107: [Rust][DataFusion] Support `SELECT * from information_sc…
alamb Apr 1, 2021
1757ba2
ARROW-12160: [Rust] Add `into_inner()` to StreamWriter
ericwburden Apr 1, 2021
090f11c
ARROW-10354: [Rust][DataFusion] regexp_extract function to select reg…
sweb Apr 1, 2021
24e3b7f
ARROW-12097: [C++] Modify BackgroundGenerator so it creates fewer thr…
westonpace Apr 1, 2021
876d579
ARROW-12068: [Python] Stop using distutils
pitrou Apr 1, 2021
938ac4f
ARROW-12169: [C++] Fix decompressing file with empty stream at the end
pitrou Apr 1, 2021
6e29200
ARROW-12000: [Documentation] Add note about deviation from style guid…
westonpace Apr 1, 2021
3679132
ARROW-12173: [GLib] Remove #include <config.h>
kou Apr 2, 2021
2f3ed3a
ARROW-12052: [Rust] Add Child Data to Arrow's C FFI implementation. …
ritchie46 Apr 2, 2021
3d87a0e
ARROW-12161: [C++] Async streaming CSV reader deadlocking when being …
westonpace Apr 2, 2021
2b87dfc
ARROW-12112: [Rust] Create and store less debug information in CI and…
alamb Apr 3, 2021
ebe4295
ARROW-12194: [Rust][Parquet] Bump zstd to v0.7
aldanor Apr 4, 2021
beb1c1b
ARROW-12186: [Rust][DataFusion] Fix regexp_match test
seddonm1 Apr 4, 2021
5cabd31
ARROW-12034: [Developer Tools] Formalize Minor PRs
emkornfield Apr 4, 2021
fb44622
ARROW-12193: [Dev][Release] Use downloadable URL for archive download
kou Apr 4, 2021
4f9a94b
ARROW-12155: [R] Require Table columns to be same length
ianmcook Apr 5, 2021
6db1183
ARROW-11822: [Rust][Datafusion] Support case sensitive comparisons fo…
alamb Apr 5, 2021
0fdbc94
ARROW-12176: [C++] Fix some typos of cpp examples
Clcanny Apr 6, 2021
3f19b15
ARROW-11513: [R] Bindings for sub/gsub
ianmcook Apr 6, 2021
19718f8
ARROW-11338: [R] Bindings for quantile and median
ianmcook Apr 6, 2021
b22f50d
ARROW-12172: [Python][Packaging] Pass python version as setuptools pr…
kszucs Apr 6, 2021
fc1e54e
ARROW-12214: [Rust][DataFusion] Add tests for limit
alamb Apr 6, 2021
dd8cd10
ARROW-12204: [Rust][CI] Reduce size of Rust build artifacts in integr…
alamb Apr 6, 2021
0de0de7
ARROW-12210: [Rust][DataFusion] Document SHOW TABLES / SHOW COLUMNS /…
alamb Apr 6, 2021
3e825a7
ARROW-12109: [Rust][DataFusion] Implement SHOW COLUMNS
alamb Apr 6, 2021
3274d08
ARROW-10882: [Python] Allow writing dataset from iterator of batches
lidavidm Apr 6, 2021
e2c22a1
ARROW-12187: [C++][FlightRPC] Add compression benchmark for stream wr…
cyb70289 Apr 6, 2021
81f6521
ARROW-7364: [Rust][DataFusion] Add cast options to cast kernel and TR…
seddonm1 Apr 6, 2021
8296bdc
ARROW-12178: [CI] Update setuptools in the ubuntu images
kszucs Apr 6, 2021
67bf0ab
ARROW-12134: [C++] Add match_substring_regex kernel
lidavidm Apr 6, 2021
2132bb2
ARROW-12206: [Python][Docs] Fix Table docstrings
albertvillanova Apr 6, 2021
a111fc9
ARROW-12226: [C++] Fix Address Sanitizer failures
pitrou Apr 6, 2021
ace2bfc
ARROW-11478: [R] Consider ways to make arrow.skip_nul option more use…
nealrichardson Apr 6, 2021
538e13b
ARROW-12227: [R] Fix RE2 and median nightly build failures
ianmcook Apr 6, 2021
9450584
ARROW-12217: [C++] Cleanup cpp examples source files naming
cyb70289 Apr 6, 2021
09c8eea
ARROW-12222: [Dev][Packaging] Include build url in the crossbow conso…
kszucs Apr 6, 2021
4384b65
ARROW-11548: [C++] Fix RandomArrayGenerator::List
pitrou Apr 6, 2021
09aada3
ARROW-12237: [Packaging][Debian] Add support for bullseye
kou Apr 7, 2021
5554c54
ARROW-12228: [CI] Create base image for conda environments
kszucs Apr 7, 2021
9e679a5
ARROW-12242: [Python][Doc] Tweak nightly build instructions
pitrou Apr 7, 2021
0c02ff9
ARROW-12112: [CI] Reduce footprint of conda-integration image
pitrou Apr 7, 2021
8780ca4
ARROW-11475: [C++] Upgrade mimalloc
pitrou Apr 7, 2021
f742968
ARROW-11464: [Python] Fix parquet.read_pandas to support all keywords…
jorisvandenbossche Apr 7, 2021
d95c72f
ARROW-9451: [Python] Refuse implicit cast of str to unsigned integer
pitrou Apr 7, 2021
be8892e
ARROW-12040: [C++] Fix potential deadlock in recursive S3 walks
westonpace Apr 7, 2021
ad54cfa
ARROW-12241: [Python] Make CSV cancellation test more robust
pitrou Apr 7, 2021
4db1b05
ARROW-12154: [C++][Gandiva] Fix gandiva crash in certain OS/CPU combi…
projjal Apr 7, 2021
53696db
ARROW-12168: [Go][IPC] Implement Compression handling for Arrow IPC
Apr 7, 2021
9a41ccb
ARROW-12190: [Rust][DataFusion] Implement parallel / partitioned hash…
Dandandan Apr 7, 2021
a574953
ARROW-12082: [R][Dataset] Allow create dataset from vector of file paths
ianmcook Apr 7, 2021
8a4a1b7
ARROW-12141: [R] Bindings for grepl
ianmcook Apr 7, 2021
e69478a
ARROW-12235: [Rust][DataFusion] LIMIT returns incorrect results when …
alamb Apr 7, 2021
f96ebcf
ARROW-12224: [Rust] Use stable rust for no default test, clean up CI …
alamb Apr 7, 2021
6b67b57
ARROW-11906 [R]: Make FeatherReader print method more informative
jonkeane Apr 7, 2021
aa111ff
ARROW-11832: [R] Handle conversion of extra nested struct column
romainfrancois Apr 7, 2021
6949f4a
ARROW-12249: [R] [CI] Fix test-r-install-local nightlies
jonkeane Apr 7, 2021
c92815d
ARROW-11754: [R] Support dplyr::compute()
ianmcook Apr 7, 2021
f2db785
ARROW-12283: [R] Bindings for basic type convert functions in dplyr v…
ianmcook Apr 8, 2021
c736034
ARROW-11336: [C++][Doc] Improve Developing on Windows docs
ianmcook Apr 8, 2021
a691e03
MINOR: [Doc] Update the scopes needed for GH PATs and crossbow (#9940)
jonkeane Apr 8, 2021
91802ae
MINOR: [Doc] Fix "Parametrized" typo in docs (#9931)
emkornfield Apr 8, 2021
7950d62
ARROW-12133: [C++][Gandiva] Add option to disable targeting host cpu …
projjal Apr 8, 2021
95ca4f5
ARROW-12215: [C++] Allow null values in fixed-size binary columns rea…
scober Apr 8, 2021
8d4ce14
ARROW-12286: [C++] Create AsyncGenerator from Future<AsyncGenerator<T>>
westonpace Apr 8, 2021
b24cff9
ARROW-10403: [C++] Implement unique kernel for non-uniform chunked di…
rok Apr 8, 2021
3258f12
ARROW-12058: [Python] Enable arithmetic operations on Expressions
jorisvandenbossche Apr 8, 2021
5489bbf
ARROW-10255: [JS] Reorganize exports for ESM tree-shaking
trxcllnt Apr 8, 2021
7679260
ARROW-12200: [R] Export and document list_compute_functions
nealrichardson Apr 8, 2021
9ea2890
ARROW-10421: [R] Use gc_memory_pool in more places
lidavidm Apr 8, 2021
ac38410
ARROW-12238: [JS] Remove trailing spaces and consistently add space a…
domoritz Apr 8, 2021
c2f665b
ARROW-12279: [Rust][DataFusion] Add test for null handling in hash jo…
Dandandan Apr 8, 2021
2fd4f80
ARROW-11982: [Rust] Donate Ballista Distributed Compute Platform
andygrove Apr 8, 2021
5b5c058
ARROW-12300: [C++] Remove linking of cuda runtime library
galipremsagar Apr 9, 2021
df12dee
ARROW-12209: [JS] Copy all src files into the the TypeScript package
trxcllnt Apr 9, 2021
493b205
MINOR: [JS] correct comment in build file (#9955)
domoritz Apr 9, 2021
113a515
ARROW-12239: [JS] Switch to yarn
domoritz Apr 9, 2021
dc182fd
ARROW-12305: [JS] Update generate.py to python3 and new versions of p…
domoritz Apr 9, 2021
60d1678
MINOR: [JS] remove superfluous space (#9953)
domoritz Apr 9, 2021
24da9fa
MINOR: [JS] Correct build commands (#9960)
domoritz Apr 9, 2021
6bace6e
ARROW-12170: [Rust][DataFusion] Introduce repartition optimization
Dandandan Apr 9, 2021
6ddaaa8
ARROW-12254: [Rust][DataFusion] Stop polling limit input once limit i…
alamb Apr 9, 2021
53b462b
ARROW-12278: [Rust][DataFusion] Use Timestamp(Nanosecond, None) for S…
alamb Apr 9, 2021
75c8cd6
ARROW-11328: [R] Collecting zero columns from a dataset returns entir…
Apr 9, 2021
c0ce2b1
ARROW-12098: [R] Catch cpp build failures on linux
nealrichardson Apr 9, 2021
3dc01c5
ARROW-11795: [MATLAB] Migrate MATLAB Interface for Apache Arrow desig…
lafiona Apr 9, 2021
868b543
ARROW-12208: [C++] Add the ability to run async tasks without using t…
westonpace Apr 9, 2021
a077d67
ARROW-11589: [R] Add methods for modifying Schemas
nealrichardson Apr 9, 2021
aa27b4c
ARROW-11766: [R] Better handling for missing compression codecs on Linux
Apr 9, 2021
f3d2d2d
ARROW-12263: [Dev][Packaging] Move Crossbow to Archery
kszucs Apr 9, 2021
4822483
ARROW-12197: [R] dplyr bindings for cast, dictionary_encode
ianmcook Apr 10, 2021
c5bc1a2
ARROW-12320: [CI] REPO arg missing from conda-cpp-valgrind
westonpace Apr 10, 2021
e20a8b2
ARROW-12327: [Dev] Use pull request's head remote when submitting cro…
kszucs Apr 10, 2021
604379e
ARROW-12328: [Rust] [Ballista] Fix formatting
andygrove Apr 10, 2021
19891d0
ARROW-12329: [Rust] [Ballista] Add Ballista README
andygrove Apr 10, 2021
552969f
ARROW-12267: [Rust] Implement support for timestamps in JSON writer
alamb Apr 11, 2021
13c334e
ARROW-12251: [Rust] Add Ballista to CI
andygrove Apr 11, 2021
c2dc1d9
ARROW-12281: [JS] Remove shx, trash, and rimraf and update learna for…
domoritz Apr 11, 2021
4d7150a
ARROW-12274: [JS] Document how to run tests without building bundles
domoritz Apr 11, 2021
fdb055f
MINOR: Fix link to definition of minor (#9989)
domoritz Apr 11, 2021
f1f4f2b
Tag Ballista PRs (#9986)
andygrove Apr 11, 2021
5b08205
ARROW-12313: [Rust] [Ballista] Update benchmark docs for Ballista
andygrove Apr 11, 2021
077fe64
ARROW-12299: [Python] Recognize new filesytems in pq.write_to_dataset
jorisvandenbossche Apr 12, 2021
af4c8fc
ARROW-12140: [C++][CI] Fix Valgrind failures in Grouper tests
pitrou Apr 12, 2021
6e61023
ARROW-11984: [C++][Gandiva] Implement SHA1 and SHA256 functions
anthonylouisbsb Apr 12, 2021
632b2c1
ARROW-12069: [C++][Gandiva] Implement IN expressions for Decimal type
jvictorhuguenin Apr 12, 2021
b43a689
ARROW-12146: [C++][Gandiva] Implement CONVERT_FROM(expression, replac…
jpedroantunes Apr 12, 2021
91a658a
ARROW-12323: [C++][Gandiva] Implement castTIME(timestamp) function
anthonylouisbsb Apr 12, 2021
66e1d2b
ARROW-11972: [C++][R][Python][Dataset] Extract IPC/Parquet fragment s…
lidavidm Apr 12, 2021
745cdb6
ARROW-12314: [Python] Accept columns as set in parquet read_pandas
jorisvandenbossche Apr 12, 2021
4c6e1b9
ARROW-11761: [C++] Increase public API testing
pitrou Apr 12, 2021
4801e43
ARROW-6248: [Python][C++] Raise better exception on HDFS file open error
pitrou Apr 12, 2021
b385fcd
ARROW-12333: [JS] Remove jest-environment-node-debug and do not emit …
domoritz Apr 12, 2021
f0bdcdb
ARROW-12303: [JS] Use iterator instead of yield
domoritz Apr 12, 2021
bc5a63b
ARROW-12236: [R][CI] Add check that all docs pages are listed in _pkg…
thisisnic Apr 12, 2021
c51c19e
ARROW-12342: [Packaging] Fix tabulation in crossbow templates for sub…
kszucs Apr 13, 2021
62f8c20
ARROW-12326: [C++] Avoid needless c-ares detection
kou Apr 13, 2021
2c77f3b
ARROW-12352: [CI][R][Windows] Remove needless workaround for MSYS2
kou Apr 13, 2021
7224920
ARROW-11752: [R] Replace usage of testthat::expect_is()
thisisnic Apr 13, 2021
1ed6819
ARROW-12277: [Rust][DataFusion] Implement Sum/Count/Min/Max aggregate…
alamb Apr 13, 2021
a102ba2
ARROW-12288: [C++] Create Scanner interface
westonpace Apr 13, 2021
57d430e
ARROW-12332: [Rust] [Ballista] Add simple api server in scheduler
msathis Apr 13, 2021
a49d6e2
ARROW-12248: [C++] Avoid looking up ARROW_DEFAULT_MEMORY_POOL environ…
pitrou Apr 13, 2021
d7558bf
ARROW-11839: [C++] Use xsimd for generation of accelerated bit-unpacking
pitrou Apr 13, 2021
00a4436
ARROW-12357: [Archery] Bump Jinja2 version requirement
pitrou Apr 13, 2021
a5f3b35
ARROW-11070: [C++][Compute] Implement power kernel
rok Apr 13, 2021
5db4092
ARROW-12316: [C++] Prefer mimalloc on Apple
pitrou Apr 13, 2021
ab3a08c
ARROW-12304: [R] Update news and polish docs for 4.0
nealrichardson Apr 13, 2021
7cec72d
ARROW-12309: [JS] Make es2015 bundles the default
domoritz Apr 14, 2021
c0445d6
ARROW-12269: [JS] Move to eslint
domoritz Apr 14, 2021
9fc589f
ARROW-12354: [Packaging][RPM] Use apache.jfrog.io/artifactory/ instea…
kou Apr 14, 2021
798a9b4
ARROW-12351: [CI][Ruby] Use ruby/setup-ruby instead of actions/setup-…
kou Apr 14, 2021
2863fdd
ARROW-11924: [C++] Add streaming version of FileSystem::GetFileInfo
pitrou Apr 14, 2021
fe83dca
ARROW-12361: [Rust] [DataFusion] Allow users to override physical opt…
andygrove Apr 14, 2021
c3f55a4
ARROW-12294: [Rust] Fix boolean kleene kernels with no remainder
ch-sc Apr 14, 2021
635ae77
ARROW-12374: [CI][C++][cron] Use Ubuntu 20.04 instead of 16.04
kou Apr 14, 2021
fb892c6
ARROW-12330: [Developer] Restore values at counters column of Archery…
kiszk Apr 14, 2021
9c85e54
ARROW-12287: [C++] Create enumerating generator
westonpace Apr 14, 2021
31c8c75
ARROW-12262: [Doc] Enable S3 and Flight in docs build
pitrou Apr 14, 2021
6f2e05d
ARROW-11677: [C++][Docs] Add basic C++ datasets documentation
lidavidm Apr 14, 2021
e99e70d
ARROW-12220: [C++][CI] Thread sanitizer failure
westonpace Apr 14, 2021
cb7a624
ARROW-12379: [C++] Fix ThreadSanitizer failure in SerialExecutor
pitrou Apr 14, 2021
8f35024
ARROW-12385: [R] [CI] fix cran picking in CI
jonkeane Apr 14, 2021
a5356c3
ARROW-12353: [Packaging][deb] Rename -archive-keyring to -apt-source
kou Apr 14, 2021
a865dc9
ARROW-12375: [Release] Remove rebase post-release scripts
kszucs Apr 14, 2021
ccdbbe3
ARROW-12384: [JS] Use let/const and clean up eslint rules
domoritz Apr 14, 2021
568046a
ARROW-11475: [C++] Revert "Upgrade mimalloc"
lidavidm Apr 14, 2021
05ec438
ARROW-12161: [C++][Dataset] Revert async CSV reader in datasets
westonpace Apr 14, 2021
b5045ed
ARROW-12382: [C++] Bundle xsimd if runtime SIMD level is set
lidavidm Apr 14, 2021
894fab0
ARROW-11468: [R] Allow user to pass schema to read_json_arrow()
thisisnic Apr 14, 2021
df20489
ARROW-12370: [R] Bindings for power kernel
Apr 14, 2021
c7485b7
ARROW-11477: [R][Doc] Reorganize and improve README and vignette content
ianmcook Apr 14, 2021
5794290
ARROW-12273: [JS] [Rust] Remove coveralls
domoritz Apr 15, 2021
02cdeab
ARROW-12376: [Dev] Log traceback for unexpected exceptions in archery…
lidavidm Apr 15, 2021
d575858
ARROW-11797: [C++][Dataset] Provide batch stream Scanner methods
bkietz Apr 15, 2021
818c57c
ARROW-12381: [Packaging][Python] macOS wheels are built with wrong pa…
kszucs Apr 15, 2021
1c0641d
ARROW-12017: [R] [Documentation] Make proper developing arrow docs
jonkeane Apr 15, 2021
1e6819c
ARROW-12057: [Python] Remove direct usage of pandas' Block subclasses…
jorisvandenbossche Apr 15, 2021
15137e2
ARROW-7215: [C++][Gandiva] Implement castVARCHAR(numeric_type) functions
anthonylouisbsb Apr 15, 2021
9178c13
ARROW-12337: [Rust] add DoubleEndedIterator and ExactSizeIterator traits
ritchie46 Apr 15, 2021
61935aa
ARROW-12250: [Rust] [Parquet] Fix failing arrow_writer test
nevi-me Apr 15, 2021
645c3f9
ARROW-11472: [Python][CI] Remove temporary pin of numpy in kartothek …
jorisvandenbossche Apr 15, 2021
1251f53
ARROW-12317: [Rust] JSON writer support for time, duration and date
houqp Apr 15, 2021
95b8f99
ARROW-12397: [Rust] [DataFusion] Simplify readme example
witchard Apr 15, 2021
26a5a04
ARROW-12390: [Rust] Inline from_trusted_len_iter, try_from_trusted_le…
Dandandan Apr 15, 2021
c394a20
ARROW-12246: [CI] Sync conda recipes with upstream feedstock
xhochy Apr 15, 2021
d4ca31f
ARROW-12396: [Python][Docs] Clarify serialization/filesystem docstrin…
jorisvandenbossche Apr 15, 2021
926452b
ARROW-12188: [Docs] Switch to pydata-sphinx-theme for the main sphinx…
jorisvandenbossche Apr 15, 2021
150224c
ARROW-12151: [Docs] Add Jira component + summary conventions to the docs
jonkeane Apr 15, 2021
2da0a37
ARROW-9731: [C++][Python][R][Dataset] Implement Scanner::Head
lidavidm Apr 15, 2021
22bebf8
ARROW-11568: [C++][Compute] Rewrite mode kernel
cyb70289 Apr 15, 2021
d11aaf4
ARROW-12383: [JS] Upgrade dependencies
domoritz Apr 15, 2021
fac30e7
ARROW-12408: [R] Delete Scan()
lidavidm Apr 15, 2021
cd4df5e
ARROW-8900: [C++][Python] Expose Proxy Options as parameters for S3Fi…
sahil1105 Apr 15, 2021
1cabc80
ARROW-12401: [R] Fix guard around dataset___Scanner__TakeRows
ianmcook Apr 15, 2021
28ed8f7
ARROW-12406: [R] Fix checkbashism violation in configure
jonkeane Apr 15, 2021
04e5256
ARROW-12409: [R] Remove LazyData from DESCRIPTION
jonkeane Apr 15, 2021
04b6de6
ARROW-12405: [Packaging] Fix apt artifact patterns and artifact uploa…
kszucs Apr 15, 2021
45847e1
ARROW-12389: [R] [Docs] Add note about autocasting
jonkeane Apr 15, 2021
49455ec
ARROW-12367: [C++] Stop producing when PushGenerator was destroyed
pitrou Apr 15, 2021
ec6436e
ARROW-12402: [Rust] [DataFusion] Implement SQL metrics example
andygrove Apr 15, 2021
958c19a
ARROW-12335: [Rust] [Ballista] Use latest DataFusion
andygrove Apr 15, 2021
6888c41
ARROW-6103: [Release][Java] Remove mvn release plugin
kszucs Apr 15, 2021
d7f90ca
ARROW-12400: [Rust] Re-enable tests in arrow::array::transform
tustvold Apr 15, 2021
3e5895d
ARROW-12111: [Java] Generate flatbuffer files using flatc 1.12.0
nbauernfeind Apr 16, 2021
b2ceb8f
ARROW-12104: [Go][Parquet] Second chunk of Ported Go Parquet code
Apr 16, 2021
715cb57
ARROW-11999: [Java] Support parallel vector element search with user-…
liyafan82 Apr 16, 2021
66aa3e7
ARROW-12380: [Rust] [Ballista] Basic scheduler ui
msathis Apr 16, 2021
9c1e5bd
ARROW-12421: [Rust] [DataFusion] Disable repartition rule
andygrove Apr 16, 2021
fdd6ab1
ARROW-12429: [C++] Fix incorrectly registered test
lidavidm Apr 17, 2021
ade457f
ARROW-12419: [Java] Remove to download flatc binary for s390x
kiszk Apr 17, 2021
d4c5c3c
ARROW-12433: [Rust] Update nightly rust version
alippai Apr 17, 2021
bb53986
ARROW-12334: [Rust] [Ballista] Aggregate queries producing incorrect …
edrevo Apr 17, 2021
7ad51be
ARROW-12437: [Rust] [Ballista] Create DataFusion context without repa…
andygrove Apr 17, 2021
27c4fa2
ARROW-12421: [Rust] [DataFusion] Fix topkexec failure
Dandandan Apr 17, 2021
be8b486
ARROW-12398: [Rust] remove redundant bound check in iterators
ritchie46 Apr 18, 2021
7e3deb5
ARROW-12425: [Rust] Fix new_null_array dictionary creation
tustvold Apr 18, 2021
6c1f6cc
ARROW-12436: [Rust][Ballista] Add watch capabilities to config backen…
edrevo Apr 18, 2021
26cdb15
Ballista now uses relative paths for dependencies (#10087)
andygrove Apr 18, 2021
9a4ef46
ARROW-12432: [Rust] [DataFusion] Add metrics to SortExec
andygrove Apr 18, 2021
e2625d2
ARROW-12440: [Release][Packaging] Various packaging, release script a…
kszucs Apr 18, 2021
8eab044
ARROW-12420: [C++/Dataset] Reading null columns as dictionary not lon…
kszucs Apr 18, 2021
2d791b4
ARROW-12395: Create RunInSerialExecutor benchmark
westonpace Apr 19, 2021
1dc8f94
ARROW-7906: [C++] [Python] Add ORC write support
Apr 19, 2021
2b2eeeb
ARROW-12289: [C++] Create basic AsyncScanner implementation
westonpace Apr 19, 2021
4d3ce2b
ARROW-12423: [Docs] Remove Codecov badge
domoritz Apr 20, 2021
32e0778
MINOR: [JS] Remove Travis badge (#10105)
domoritz Apr 20, 2021
930c381
ARROW-12475: [C++] Fix 'warn_unused_result' warning
cyb70289 Apr 20, 2021
6b4a4aa
ARROW-12466: [Python] Avoid AttributeError crash when comparing with …
amol- Apr 20, 2021
893bcc2
ARROW-12477: [Release] Download aarch64 miniforge
cyb70289 Apr 21, 2021
3dd6d32
ARROW-12488: [GLib] Use g_memdup2() with GLib 2.68 or later
kou Apr 21, 2021
2ed54db
ARROW-12325: [C++] [CI] Nightly gandiva build failing due to failure …
westonpace Apr 21, 2021
c408048
ARROW-12467: [C++][Gandiva] Add support for LLVM12
kszucs Apr 21, 2021
7c4a07e
ARROW-12487: [C++][Dataset] Fix ScanBatches() hanging
lidavidm Apr 21, 2021
be258f2
ARROW-12485: [C++] Use mimalloc as the default memory allocator on macOS
jonkeane Apr 21, 2021
37c27d1
ARROW-10195: [C++] Add string struct extract kernel using re2
maartenbreddels Apr 21, 2021
c71becf
ARROW-12494: [C++] ORC adapter fails to compile on GCC 4.8
kszucs Apr 21, 2021
7a0cb10
ARROW-12495: [C++] Fix NumPyBuffer::mutable_data()
pitrou Apr 21, 2021
15b5d8e
[Release] Update CHANGELOG.md for 4.0.0
kszucs Apr 21, 2021
6d5f468
[Release] Update .deb/.rpm changelogs for 4.0.0
kszucs Apr 21, 2021
f959141
[Release] Update versions for 4.0.0
kszucs Apr 21, 2021
8004518
ARROW-12520: [R] Minor docs updates
jonkeane Apr 26, 2021
b1303cd
ARROW-12601: [R][Packaging] Fix pkg-config check in r/configure
ianmcook Apr 29, 2021
c5ee38c
ARROW-11926: [R] preparations for ucrt toolchains
jeroen Apr 30, 2021
dbf16e3
ARROW-12622: [Python] Fix segfault in read_csv when not on main thread
lidavidm May 3, 2021
f80292d
ARROW-12617: [Python] Align orc.write_table keyword order with parque…
jorisvandenbossche May 3, 2021
63954cb
ARROW-12568: [C++][Compute] Fix nullptr deference when array contains…
cyb70289 May 3, 2021
be5f704
ARROW-12571: [R][CI] Run nightly R with valgrind
jonkeane May 5, 2021
a891241
ARROW-12642: [R] LIBARROW_MINIMAL, LIBARROW_DOWNLOAD, NOT_CRAN env va…
ianmcook May 5, 2021
6a2915e
ARROW-12663: [C++] Fix a cuda 11.2 compiler segfault
galipremsagar May 10, 2021
af476c6
ARROW-12670: [C++] Fix extract_regex output after non-matching values
pitrou May 11, 2021
39dd06b
ARROW-12746: [Go][Flight] append instead of overwriting outgoing meta…
May 13, 2021
efdfb6d
ARROW-12774 : [C++][Compute] replace_substring_regex() creates invali…
nirandaperera May 15, 2021
eaac875
ARROW-12578: [JS] Remove Buffer in favor of TextEncoder API to suppor…
domoritz May 15, 2021
7e2fdda
ARROW-12776: [Archery][Integration] Fix decimal case generation in wr…
kszucs May 14, 2021
cd0a083
ARROW-12769: [Python] Fix slicing array with "negative" length (start…
jorisvandenbossche May 17, 2021
577863f
ARROW-12806: [Python] test_write_to_dataset_filesystem missing a data…
jorisvandenbossche May 17, 2021
63da155
ARROW-12619: [Python] pyarrow sdist should not require git
kszucs May 17, 2021
d1ca32e
ARROW-12603: [C++][Dataset] Backport fix for specifying CSV column ty…
lidavidm May 17, 2021
7dcc554
[Release] Update CHANGELOG.md for 4.0.1
jorgecarleitao May 19, 2021
126715f
[Release] Update .deb/.rpm changelogs for 4.0.1
jorgecarleitao May 19, 2021
81ff679
[Release] Update versions for 4.0.1
jorgecarleitao May 19, 2021
29f8fe8
Merge tag 'apache-arrow-4.0.1' of github.com:apache/arrow into update
Jun 1, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
42 changes: 34 additions & 8 deletions .env
Original file line number Diff line number Diff line change
Expand Up @@ -18,31 +18,57 @@
# All of the following environment variables are required to set default values
# for the parameters in docker-compose.yml.

# empty prefix means that the docker-compose configuration will use named
# volumes which potentially improves the performance on docker for macos and
# docker for windows, it also prevents the contamination of the source
# directory
# a non-empty prefix means that directories from the host are bind-mounted
# into the container, it should be set to ".docker/" on github actions to keep
# the cache plugin functional
DOCKER_VOLUME_PREFIX=

# turn on inline build cache, this is a docker buildx feature documented
# at https://github.com/docker/buildx#--cache-tonametypetypekeyvalue
COMPOSE_DOCKER_CLI_BUILD=1
DOCKER_BUILDKIT=1
BUILDKIT_INLINE_CACHE=1

# different architecture notations
ARCH=amd64
ARCH_ALIAS=x86_64
ARCH_SHORT_ALIAS=x64

ULIMIT_CORE=-1
REPO=apache/arrow-dev
ARCH=amd64
CUDA=9.1
DEBIAN=10
UBUNTU=18.04
FEDORA=32
UBUNTU=20.04
FEDORA=33
PYTHON=3.6
LLVM=10
LLVM=12
CLANG_TOOLS=8
RUST=nightly-2020-04-22
GO=1.12
RUST=nightly-2021-03-24
GO=1.15
NODE=14
MAVEN=3.5.4
JDK=8
NUMPY=latest
PANDAS=latest
DASK=latest
TURBODBC=latest
KARTOTHEK=latest
HDFS=2.9.2
HDFS=3.2.1
SPARK=master
DOTNET=2.1
DOTNET=3.1
R=4.0
ARROW_R_DEV=TRUE
# These correspond to images on Docker Hub that contain R, e.g. rhub/ubuntu-gcc-release:latest
R_ORG=rhub
R_IMAGE=ubuntu-gcc-release
R_TAG=latest
# -1 does not attempt to install a devtoolset version, any positive integer will install devtoolset-n
DEVTOOLSET_VERSION=-1

# Used for the manylinux and windows wheels, please update the crossbow configuration on update:
# https://github.com/ursacomputing/crossbow/blob/master/.github/workflows/cache_vcpkg.yml
VCPKG=fced4bef1606260f110d74de1ae1975c2b9ac549
9 changes: 4 additions & 5 deletions .github/workflows/archery.yml
Original file line number Diff line number Diff line change
Expand Up @@ -43,23 +43,22 @@ jobs:
with:
fetch-depth: 0
- name: Git Fixup
if: ${{ github.event_name == 'pull_request' }}
if: ${{ github.ref != 'master' }}
shell: bash
run: git branch master origin/master
- name: Free Up Disk Space
run: ci/scripts/util_cleanup.sh
- name: Setup Python
uses: actions/setup-python@v1
with:
python-version: '3.5'
python-version: '3.6'
- name: Install Archery, Crossbow- and Test Dependencies
working-directory: dev/archery
run: pip install pytest responses toolz jinja2 -e .[all]
run: pip install pytest responses -e dev/archery[all]
- name: Archery Unittests
working-directory: dev/archery
run: pytest -v archery
- name: Archery Docker Validation
run: archery docker
- name: Crossbow Check Config
working-directory: dev/tasks
run: python crossbow.py check-config
run: archery crossbow check-config
123 changes: 123 additions & 0 deletions .github/workflows/cancel.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,123 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

name: Cancel stale runs

on:
workflow_run:
# The name of another workflow (whichever one) that always runs on PRs
workflows: ['Dev']
types: ['requested']

jobs:
cancel-stale-workflow-runs:
name: "Cancel stale workflow runs"
runs-on: ubuntu-latest
steps:
# Unfortunately, we need to define a separate cancellation step for
# each workflow where we want to cancel stale runs.
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale C++ runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: cpp.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale C# runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: csharp.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Dev runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: dev.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Go runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: go.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Integration runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: integration.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Java JNI runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: java_jni.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Java runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: java.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale JS runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: js.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Julia runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: julia.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Python runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: python.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale R runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: r.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Ruby runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: ruby.yml
skipEventTypes: '["push", "schedule"]'
- uses: potiuk/cancel-workflow-runs@master
name: "Cancel stale Rust runs"
with:
cancelMode: allDuplicates
token: ${{ secrets.GITHUB_TOKEN }}
workflowFileName: rust.yml
skipEventTypes: '["push", "schedule"]'
23 changes: 9 additions & 14 deletions .github/workflows/comment_bot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -34,17 +34,12 @@ jobs:
uses: actions/checkout@v2
with:
path: arrow
# because libgit2 is a dependency of crossbow so prefer conda
- name: Setup Conda
uses: s-weigand/setup-conda@v1
- name: Set up Python
uses: actions/setup-python@v2
with:
update-conda: true
python-version: 3.7
conda-channels: conda-forge
- name: Instal Archery and Crossbow dependencies
run: |
conda install -y --file arrow/ci/conda_env_crossbow.txt pygithub
pip install -e arrow/dev/archery[bot]
python-version: 3.8
- name: Install Archery and Crossbow dependencies
run: pip install -e arrow/dev/archery[bot]
- name: Handle Github comment event
env:
ARROW_GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
Expand Down Expand Up @@ -72,16 +67,16 @@ jobs:
git diff --name-only HEAD..upstream/master | grep -e "$1" >/dev/null 2>&1
}
if changed '^r/.*\.R$'; then
echo "::set-env name=R_DOCS::true"
echo "R_DOCS=true" >> $GITHUB_ENV
fi
if changed 'cmake' || changed 'CMake'; then
echo "::set-env name=CMAKE_FORMAT::true"
echo "CMAKE_FORMAT=true" >> $GITHUB_ENV
fi
if changed '^cpp/src'; then
echo "::set-env name=CLANG_FORMAT_CPP::true"
echo "CLANG_FORMAT_CPP=true" >> $GITHUB_ENV
fi
if changed '^r/src'; then
echo "::set-env name=CLANG_FORMAT_R::true"
echo "CLANG_FORMAT_R=true" >> $GITHUB_ENV
fi
- name: Run cmake_format
if: env.CMAKE_FORMAT == 'true' || endsWith(github.event.comment.body, 'everything')
Expand Down
12 changes: 6 additions & 6 deletions .github/workflows/cpp.yml
Original file line number Diff line number Diff line change
Expand Up @@ -38,9 +38,8 @@ on:
- 'format/Flight.proto'

env:
DOCKER_BUILDKIT: 0
COMPOSE_DOCKER_CLI_BUILD: 1
ARROW_ENABLE_TIMING_TESTS: OFF
DOCKER_VOLUME_PREFIX: ".docker/"
ARCHERY_DOCKER_USER: ${{ secrets.DOCKERHUB_USER }}
ARCHERY_DOCKER_PASSWORD: ${{ secrets.DOCKERHUB_TOKEN }}

Expand All @@ -60,7 +59,7 @@ jobs:
- image: conda-cpp
title: AMD64 Conda C++
- image: ubuntu-cpp-sanitizer
title: AMD64 Ubuntu 18.04 C++ ASAN UBSAN
title: AMD64 Ubuntu 20.04 C++ ASAN UBSAN
steps:
- name: Checkout Arrow
uses: actions/checkout@v2
Expand All @@ -71,7 +70,7 @@ jobs:
- name: Free Up Disk Space
run: ci/scripts/util_cleanup.sh
- name: Cache Docker Volumes
uses: actions/cache@v1
uses: actions/cache@v2
with:
path: .docker
key: ${{ matrix.image }}-${{ hashFiles('cpp/**') }}
Expand Down Expand Up @@ -218,7 +217,9 @@ jobs:
- name: Install Dependencies
shell: bash
run: |
rm -f /usr/local/bin/2to3
brew update --preinstall
brew unlink gcc@8 gcc@9
brew bundle --file=cpp/Brewfile
- name: Build
shell: bash
Expand Down Expand Up @@ -338,7 +339,6 @@ jobs:
CMAKE_ARGS: >-
-DARROW_PACKAGE_PREFIX=/mingw${{ matrix.mingw-n-bits }}
-DBoost_NO_BOOST_CMAKE=ON
CMAKE_GENERATOR: MSYS Makefiles
CMAKE_UNITY_BUILD: ON
steps:
- name: Disable Crash Dialogs
Expand All @@ -365,7 +365,7 @@ jobs:
run: |
ci/scripts/msys2_setup.sh cpp
- name: Cache ccache
uses: actions/cache@v1
uses: actions/cache@v2
with:
path: ccache
key: cpp-ccache-mingw${{ matrix.mingw-n-bits }}-${{ hashFiles('cpp/**') }}
Expand Down
30 changes: 12 additions & 18 deletions .github/workflows/cpp_cron.yml
Original file line number Diff line number Diff line change
Expand Up @@ -30,9 +30,8 @@ on:
0 */12 * * *

env:
DOCKER_BUILDKIT: 0
COMPOSE_DOCKER_CLI_BUILD: 1
ARROW_ENABLE_TIMING_TESTS: OFF
DOCKER_VOLUME_PREFIX: ".docker/"
ARCHERY_DOCKER_USER: ${{ secrets.DOCKERHUB_USER }}
ARCHERY_DOCKER_PASSWORD: ${{ secrets.DOCKERHUB_TOKEN }}

Expand All @@ -46,36 +45,31 @@ jobs:
matrix:
name:
- amd64-debian-10-cpp
- amd64-fedora-32-cpp
- amd64-ubuntu-16.04-cpp
- amd64-fedora-33-cpp
- amd64-ubuntu-18.04-cpp
- amd64-ubuntu-18.04-cpp-cmake32
- amd64-ubuntu-20.04-cpp
include:
- name: amd64-debian-10-cpp
image: debian-cpp
title: AMD64 Debian 10 C++
debian: 10
- name: amd64-fedora-32-cpp
- name: amd64-fedora-33-cpp
image: fedora-cpp
title: AMD64 Fedora 32 C++
fedora: 32
- name: amd64-ubuntu-16.04-cpp
image: ubuntu-cpp
title: AMD64 Ubuntu 16.04 C++
ubuntu: 16.04
title: AMD64 Fedora 33 C++
fedora: 33
- name: amd64-ubuntu-18.04-cpp
image: ubuntu-cpp
title: AMD64 Ubuntu 18.04 C++
ubuntu: 18.04
- name: amd64-ubuntu-18.04-cpp-cmake32
image: ubuntu-cpp-cmake32
title: AMD64 Ubuntu 18.04 C++ CMake 3.2
ubuntu: 18.04
- name: amd64-ubuntu-20.04-cpp
image: ubuntu-cpp
title: AMD64 Ubuntu 20.04 C++
ubuntu: 20.04
env:
# the defaults here should correspond to the values in .env
ARCH: 'amd64'
DEBIAN: ${{ matrix.debian || 10 }}
FEDORA: ${{ matrix.fedora || 32 }}
FEDORA: ${{ matrix.fedora || 33 }}
UBUNTU: ${{ matrix.ubuntu || 18.04 }}
steps:
- name: Checkout Arrow
Expand All @@ -87,7 +81,7 @@ jobs:
- name: Free Up Disk Space
run: ci/scripts/util_cleanup.sh
- name: Cache Docker Volumes
uses: actions/cache@v1
uses: actions/cache@v2
with:
path: .docker
key: ${{ matrix.name }}-${{ hashFiles('cpp/**') }}
Expand Down
Loading