Skip to content

Conversation

@norberttech
Copy link
Member

Change Log

Added

  • DataFrame::duplicateRow() method
  • with_entry() function

Fixed

Changed

  • DataFrame::with/transform/withEntries now also accepts WithEntry instance

Removed

Deprecated

Security


Description

Resolves #1539

@norberttech norberttech linked an issue Mar 20, 2025 that may be closed by this pull request
@github-actions
Copy link
Contributor

github-actions bot commented Mar 20, 2025

Flow PHP - Benchmarks

Results of the benchmarks from this PR are compared with the results from 1.x branch.

Extractors
+-----------------------+-------------------+------+-----+-----------------+------------------+-----------------+
| benchmark             | subject           | revs | its | mem_peak        | mode             | rstdev          |
+-----------------------+-------------------+------+-----+-----------------+------------------+-----------------+
| CSVExtractorBench     | bench_extract_10k | 1    | 3   | 4.868mb +0.17%  | 620.559ms -0.20% | ±2.22% +27.91%  |
| JsonExtractorBench    | bench_extract_10k | 1    | 3   | 4.942mb +0.18%  | 1.126s +0.64%    | ±0.39% -9.80%   |
| ParquetExtractorBench | bench_extract_10k | 1    | 3   | 86.461mb +0.00% | 899.079ms -1.77% | ±1.96% +288.93% |
| TextExtractorBench    | bench_extract_10k | 1    | 3   | 4.595mb +0.12%  | 37.986ms -1.22%  | ±0.68% +136.47% |
| XmlExtractorBench     | bench_extract_10k | 1    | 3   | 4.569mb +0.13%  | 612.003ms +1.54% | ±1.66% -44.49%  |
+-----------------------+-------------------+------+-----+-----------------+------------------+-----------------+
Transformers
+-----------------------------+--------------------------+------+-----+------------------+-----------------+----------------+
| benchmark                   | subject                  | revs | its | mem_peak         | mode            | rstdev         |
+-----------------------------+--------------------------+------+-----+------------------+-----------------+----------------+
| RenameEntryTransformerBench | bench_transform_10k_rows | 1    | 3   | 127.391mb +0.01% | 70.944ms -0.34% | ±1.28% +37.20% |
+-----------------------------+--------------------------+------+-----+------------------+-----------------+----------------+
Loaders
+--------------------+----------------+------+-----+------------------+------------------+----------------+
| benchmark          | subject        | revs | its | mem_peak         | mode             | rstdev         |
+--------------------+----------------+------+-----+------------------+------------------+----------------+
| CSVLoaderBench     | bench_load_10k | 1    | 3   | 64.036mb +0.01%  | 104.896ms -1.93% | ±1.08% -3.78%  |
| JsonLoaderBench    | bench_load_10k | 1    | 3   | 84.416mb +0.01%  | 97.598ms -2.07%  | ±0.72% +41.29% |
| ParquetLoaderBench | bench_load_10k | 1    | 3   | 161.266mb +0.01% | 20.512s -1.92%   | ±0.10% -80.09% |
| TextLoaderBench    | bench_load_10k | 1    | 3   | 18.127mb +0.03%  | 31.200ms -1.32%  | ±0.55% +4.12%  |
+--------------------+----------------+------+-----+------------------+------------------+----------------+
Building Blocks
+-------------------+----------------------------+------+-----+------------------+------------------+-----------------+
| benchmark         | subject                    | revs | its | mem_peak         | mode             | rstdev          |
+-------------------+----------------------------+------+-----+------------------+------------------+-----------------+
| TypeDetectorBench | bench_type_detector        | 1    | 3   | 43.912mb +0.01%  | 363.873ms -1.32% | ±0.11% -81.88%  |
| TypeDetectorBench | bench_type_detector        | 1    | 3   | 11.722mb +0.02%  | 72.723ms -0.71%  | ±0.72% +2.06%   |
| EntryFactoryBench | bench_entry_factory        | 1    | 3   | 106.017mb +0.01% | 513.833ms -2.36% | ±0.14% -78.52%  |
| EntryFactoryBench | bench_entry_factory        | 1    | 3   | 55.208mb +0.02%  | 263.089ms -0.85% | ±1.52% +714.30% |
| EntryFactoryBench | bench_entry_factory        | 1    | 3   | 14.730mb +0.07%  | 56.262ms -3.00%  | ±0.56% -43.90%  |
| RowsBench         | bench_chunk_10_on_10k      | 2    | 3   | 97.054mb +0.01%  | 3.263ms -18.05%  | ±1.95% +11.61%  |
| RowsBench         | bench_diff_left_1k_on_10k  | 2    | 3   | 114.344mb +0.06% | 184.844ms -0.62% | ±0.31% -77.45%  |
| RowsBench         | bench_diff_right_1k_on_10k | 2    | 3   | 97.064mb +0.07%  | 18.496ms -0.68%  | ±1.13% +315.99% |
| RowsBench         | bench_drop_1k_on_10k       | 2    | 3   | 97.929mb +0.01%  | 1.524ms -26.35%  | ±0.84% +242.45% |
| RowsBench         | bench_drop_right_1k_on_10k | 2    | 3   | 97.929mb +0.01%  | 1.548ms -19.56%  | ±2.33% +65.47%  |
| RowsBench         | bench_entries_on_10k       | 2    | 3   | 96.089mb +0.01%  | 4.648ms -11.49%  | ±2.68% +10.13%  |
| RowsBench         | bench_filter_on_10k        | 2    | 3   | 96.618mb +0.01%  | 15.933ms -3.95%  | ±0.78% -19.92%  |
| RowsBench         | bench_find_on_10k          | 2    | 3   | 96.618mb +0.01%  | 15.794ms -5.61%  | ±0.87% +15.74%  |
| RowsBench         | bench_find_one_on_10k      | 10   | 3   | 95.310mb +0.01%  | 1.900μs -5.00%   | ±0.00% +0.00%   |
| RowsBench         | bench_first_on_10k         | 10   | 3   | 95.310mb +0.01%  | 0.400μs -20.00%  | ±0.00% +0.00%   |
| RowsBench         | bench_flat_map_on_1k       | 2    | 3   | 104.528mb +0.01% | 14.954ms -13.00% | ±0.49% -80.87%  |
| RowsBench         | bench_map_on_10k           | 2    | 3   | 134.595mb +0.01% | 74.275ms -4.57%  | ±1.55% -48.84%  |
| RowsBench         | bench_merge_1k_on_10k      | 2    | 3   | 97.138mb +0.01%  | 1.342ms -38.13%  | ±2.00% +45.83%  |
| RowsBench         | bench_partition_by_on_10k  | 2    | 3   | 100.509mb +0.01% | 63.968ms -1.33%  | ±1.11% +506.75% |
| RowsBench         | bench_remove_on_10k        | 2    | 3   | 98.191mb +0.01%  | 3.785ms -13.69%  | ±1.99% +10.67%  |
| RowsBench         | bench_sort_asc_on_1k       | 2    | 3   | 95.606mb +0.01%  | 41.506ms -7.11%  | ±1.99% -30.85%  |
| RowsBench         | bench_sort_by_on_1k        | 2    | 3   | 95.606mb +0.01%  | 41.143ms -2.56%  | ±0.40% -71.17%  |
| RowsBench         | bench_sort_desc_on_1k      | 2    | 3   | 95.606mb +0.01%  | 41.431ms -4.94%  | ±0.68% -59.75%  |
| RowsBench         | bench_sort_entries_on_1k   | 2    | 3   | 97.750mb +0.01%  | 8.173ms -5.89%   | ±1.09% -17.93%  |
| RowsBench         | bench_sort_on_1k           | 2    | 3   | 95.500mb +0.01%  | 29.160ms -2.89%  | ±2.26% +4.09%   |
| RowsBench         | bench_take_1k_on_10k       | 10   | 3   | 95.310mb +0.01%  | 14.812μs +1.50%  | ±1.79% -14.51%  |
| RowsBench         | bench_take_right_1k_on_10k | 10   | 3   | 95.310mb +0.01%  | 15.394μs -10.09% | ±0.31% -58.02%  |
| RowsBench         | bench_unique_on_1k         | 2    | 3   | 114.410mb +0.01% | 188.864ms +0.74% | ±0.27% -88.34%  |
+-------------------+----------------------------+------+-----+------------------+------------------+-----------------+

@codecov
Copy link

codecov bot commented Mar 20, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 83.10%. Comparing base (5adf480) to head (0b186c9).
Report is 2 commits behind head on 1.x.

Additional details and impacted files
@@            Coverage Diff             @@
##              1.x    #1541      +/-   ##
==========================================
+ Coverage   83.01%   83.10%   +0.08%     
==========================================
  Files         687      689       +2     
  Lines       18715    18778      +63     
==========================================
+ Hits        15537    15605      +68     
+ Misses       3178     3173       -5     
Components Coverage Δ
etl 86.20% <100.00%> (+0.16%) ⬆️
cli 84.35% <ø> (ø)
lib-array-dot 94.53% <ø> (ø)
lib-azure-sdk 62.56% <ø> (ø)
lib-doctrine-dbal-bulk 90.11% <ø> (ø)
lib-filesystem 78.02% <ø> (ø)
lib-parquet 84.33% <ø> (ø)
lib-parquet-viewer 82.02% <ø> (ø)
lib-snappy 91.16% <ø> (+0.46%) ⬆️
bridge-filesystem-async-aws 90.38% <ø> (ø)
bridge-filesystem-azure 89.92% <ø> (ø)
bridge-monolog-http 96.38% <ø> (ø)
symfony-http-foundation 74.41% <ø> (ø)
adapter-chartjs 86.45% <ø> (ø)
adapter-csv 89.57% <ø> (ø)
adapter-doctrine 89.14% <ø> (ø)
adapter-elasticsearch 97.19% <ø> (ø)
adapter-google-sheet 78.04% <ø> (ø)
adapter-http 59.15% <ø> (ø)
adapter-json 90.62% <ø> (ø)
adapter-logger 53.84% <ø> (ø)
adapter-meilisearch 97.75% <ø> (ø)
adapter-parquet 80.85% <ø> (ø)
adapter-text 84.44% <ø> (ø)
adapter-xml 83.15% <ø> (ø)
🚀 New features to boost your workflow:
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@norberttech norberttech merged commit 19106a4 into flow-php:1.x Mar 20, 2025
23 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Duplicate Row

2 participants