Jemalloc performance on 64-bit ARM

I've just run the `binary_trees` benchmark on an `ARMv8`, Cortex-A53 processor, having converted an Android TV box to Linux. 

I'd found previously, on a much weaker (but more power efficient) `armv7` Cortex A5, the results were equal. On the new machine (using the latest official `aarch64` rustc nightly) `./binary_trees 23` produces the following results:

`sysalloc` **1m28s 5m10s 0m10s**
`jemalloc` **1m35s 5m10s 0m53s**

which is palpably worse actually, even though Cortex-A53 is a much stronger core.

I'm beginning to think `jemalloc` only makes sense on Intel processors with heaps or L1/L2 cache.

More benchmark ideas welcome, though.

**added retroactively:**
To reproduce, unpack the [attachment](https://github.com/rust-lang/rust/files/333632/binary_trees_benchmark.zip) and run:

``` shell
cargo build --release && time target/release/binary_trees 23
```

 inside the binary_trees directory. Uncomment the first 2 lines in main.rs to produce a sysalloc version.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Jemalloc performance on 64-bit ARM #34476

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Jemalloc performance on 64-bit ARM #34476

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions