Skip to content

[Python] take function doesn't work when table has large row counts #31249

@asfimport

Description

@asfimport

Steps to reproduce the error:

 

import pyarrow as pa
a = pa.array(["123456789"]*260000000)
tb = pa.table([a], names=["a"])
tb.take([1])

 

ERROR message:
ArrowInvalid: offset overflow while concatenating arrays

Environment: pyarrow 7.0.0, pyarrow 5.0
Reporter: Zhe Wu

Related issues:

Note: This issue was originally created as ARROW-15808. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions