Skip to content

Releases: getdozer/dozer

v0.1.13

24 Mar 11:23
71bfb1f
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.12...v0.1.13

v0.1.12

17 Mar 11:24
7075dcb
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.11...v0.1.12

v0.1.11

10 Mar 13:18
9029376
Compare
Choose a tag to compare

New Features

Bug Fixes

  • fix: Use same progress bar style (expect for color) for source and sink by @chubei in #1100
  • fix: updated download_binary by @chloeminkyung in #1107
  • fix: Sink should only be built after all source checkpoints are checked by @chubei in #1112
  • fix: Blue cache was counted with limit. Green cache didn't get picked up. by @chubei in #1123
  • fix: Define unique temp folder for aggregation tests by @mediuminvader in #1124
  • fix: groupby without measures were handled wrongly by @snork-alt in #1127
  • fix: add postgres columns validations and connector test by @hoangnh93 in #1035
  • fix: Ingestion via Arrow by @v3g42 in #1141
  • fix: load_multi_sources test was not implemented correctly by @chubei in #1146
  • fix: Remove table name property usage in deltalake connector by @karolisg in #1155
  • fix: Fix compilation error introduced in #1158 by @chubei in #1183
  • fix: Add sources names used in the query by @mediuminvader in #1190
  • fix: PK was not automatically when GROUP BY is used by @snork-alt in #1193

New Contributors

Full Changelog: v0.1.10...v0.1.11

v0.1.10

28 Feb 16:56
bd0f1ae
Compare
Choose a tag to compare

New Features

  • Blue/Green Cache swap
  • Timestamp difference
  • Improved transaction performance
  • Introduce DAY_OF_WEEK() date function
  • Extend SQL type support for math operations
  • Show sources ingestion progress
  • Parallelized ingestion of Postgres snapshot data

Bug Fixes

  • Fix timestamp, point, decimal support for grpc
  • Fix use of wildcard in select
  • Fix convert from jsonb to dozer type
  • Fix check of supported tables in postgres
  • Clean dozer on start because blue/green cache is not switched correctly
  • Fix columns order in schema
  • Primary key field conversion from REST url
  • Sort schemas by sources order in dozer config

Full Changelog: v0.1.9...v0.1.10

v0.1.9

27 Feb 07:46
dbc5af9
Compare
Choose a tag to compare

Bug Fixes

  • Fix the replication slot check
  • Remove non-existent proto include
  • Fix insert operation in Record Readers

Full Changelog: v0.1.8...v0.1.9

v0.1.8

24 Feb 09:45
c9b970a
Compare
Choose a tag to compare

New Features

  • Support UNION and UNION all Set operations
  • Support Geographical Point type and Distance
  • Return Record id and version in REST APIs, gRPC endpoints and gRPC events
  • Support decimal and binary for gRPC
  • Support $after expression in pagination
  • Specify LmdbCacheManager and endpoint as an alias
  • Generating the Graph of the Pipeline
  • Expose Pipeline configuration

Bug Fixes

  • Datafusion connector compilation
  • Support mathematical operations for UINT
  • TypedService didn't filter events based on endpoint
  • Fix ls command of the Object Storage connector
  • Add missing types.proto reference
  • Use correct path builder for local storage
  • Fix eth-traces e2e test
  • Health check should not return NotServing when dynamic is false
  • Config in e2e test flags_dynamic_false
  • Fix release.yml identation
  • Add epsilon to distance test case
  • Don't create default connector config
  • Remove leaked in eth_union e2e case
  • Eth blocks parse fails
  • Remove RwCache::insert_schema so we don't read the db on every insert
  • Invalid column error
  • Fix snowflake table name validation
  • Fix snowflake schema helper import

New Contributors

Full Changelog: v0.1.7...v0.1.8

v0.1.7

11 Feb 06:49
83c4e82
Compare
Choose a tag to compare

New Features

  • New DataFusion Connector
  • Simplified Dag interface
  • All nodes use the same instant for commit timeout
  • SQL transformations are common across sources
  • LmdbRoCache and LmdbRwCache implement Send and Sync
  • Stateful pipeline for Sources without Primary Key
  • Separation of Dag execution and Dag construction

Bug fixes

  • Fix record delete when duplicated words are in string field
  • Fix usage of decimal with trailing zeros in secondary index
  • Fix e2e tests and hanging epoch manager test
  • Rephrase error when primary key is missing
  • buildkite integration test for eth e2e case

Full Changelog: v0.1.6...v0.1.7

v0.1.6

03 Feb 05:09
9a39b05
Compare
Choose a tag to compare

New Features

  • Support Wildcard expression in Select clause
  • Expression parsing support nested aggregations
  • Implement Continue Ingestion and Stream creation in Snowflake
  • Implement direct insert to cache pipeline
  • Support Multi-way JOIN
  • Improved DAG structure with daggy
  • Atomatically trim record history in RecordWriter
  • Only persist checkpoint states of relevant sources.
  • Add sql_date support to snowflake integration
  • adding commit SHA for ECR image tag for release
  • Change app config to support source name in api endpoints

Bug fixes

  • Mapping of sqlite data type to dozer data type
  • CONCAT() to support multiple arguments
  • Unify typed service query and on_event record representation
  • Implement record versioning in RecordWriter and RecordReader
  • Fix minor snowflake connector issues
  • Continue REPL execution after unknown command
  • Forbid duplicated cte names
  • Insert after delete problem in aggregation
  • Use context to solve alias properly
  • Fix snowflake connector
  • Unable to clean untagged images
  • Fix source with empty columns list validation
  • serde_str is needed to bincode deserialize Decimal
  • Don't commit when no source produces new op
  • Fix stream exist check for tables
  • Architecture diagram
  • Fix snowflake stream consumption
  • Ethereum end to end test
  • ApiEndpoint is not serializing all fields
  • Integration tests required dozer-tests image when using local runner
  • Clear PK from Projection output

Full Changelog: v0.1.5...v0.1.6

v0.1.5

20 Jan 08:34
7b824f4
Compare
Choose a tag to compare

New Features

Support CAST operator
Support UInt aggregation
Implement continuous ingestion for Ethereum
Implement nested queries and CTE
Support bpchar column type from Postgres

Bug fixes

Fixed Ethereum E2E test
Update primary key mismatch error message
Sequence number reset if no data is sent from source by
Count APIs default to no limit
Fixed on_event behavior
Fixed Discord notification block
Dozer-api test utils generating inconsistent schema and record
Schema id was not copied to records before inserting
Allow text fields being used in full text index
Generate full text index for String fields
Ignore unused sources for pipeline
Copy schema id to records before letting cache handle it
Fix sql parsing panic and statements being silently ignored
Need ignore_pipeline_branch_filters to trigger when pushes are disabled
OnEvent is not propagating data in some scenarios with postgres
Don't pass commit message to BuildKite.
Update issue template config with parent item

New Contributors

Full Changelog: v0.1.4...v0.1.5

Development Release - dev

18 Jan 09:25
b508e65
Compare
Choose a tag to compare
Pre-release

Bug Fixes

  • Improve performance of date parsing (#2306)
  • Remove ApiEndpoint::name to be consistent with other kinds of sinks (#2316)
  • Log aerospike sink error and increase batch timeout (#2336)
  • Dozer-core now propagates recoverable errors instead of panicking (#2348)
  • Don't bail out if log manager can't be started (#2365)
  • Read full message in aerospike tcp server (#2368)
  • Fix table indexes usage in aerospike connector (#2370)
  • Avoid sending repeated messages when error happens in Oracle log miner (#2373)
  • Fix oracle connector update parser (#2386)
  • i8 -> c_char (#2410)
  • Fix dummy sink performance output (#2411)
  • Oracle source skips uncommitted operations that are before another commit (#2413)
  • Oracle source replication dead loop (#2416)
  • Oracle redo log can have NULL REDO_SQL (#2418)
  • Aggregate transactions during replication to avoid losing part of long running transactions (#2423)

Features

  • Enable postgres continue ingestion (#2290)
  • Snowflake sink (#2266)
  • Support different kinds of endpoints (#2294)
  • Implement batch processing for stateless processors (#2319)
  • Continue mysql ingestion (#2322)
  • Webhook-connector (#2355)
  • Clickhouse sink (#2353)
  • Oracle connector snapshotting (#2359)
  • Oracle replication (#2361)
  • Aerospike ingestion (#2366)
  • Use http1 server for aerospike (#2374)
  • Dozer ui and dozer ui update (#2376)
  • Stop ingestion after number of events in dummy sink (#2394)
  • Print average latency in dummy sink (#2395)
  • Allow aerospike sink to handle multiple tables (#2393)
  • Support integer on oracle ingestion/sink (#2419)

Fix

  • Revert oracle resumability and fix snapshot data insert (#2425)

Miscellaneous Tasks

  • Move last_checkpoint from Source::start to SourceFactory::build (#2264)
  • Add latency measuring code in dummy sink (#2318)
  • Diable stateful output port (#2330)
  • Adapt UI for multiple tables in 1 sink (#2408)
  • Add dev container (#2412)
  • Stop using larger runners (#2417)
  • Add oracle instant client library to docker image (#2421)
  • Clean up dozer_sql::builder (#2424)
  • Remove clones and duplicated code in dozer_sql::builder (#2428)
  • Update version number to 0.4.0 (#2432)

Performance

  • Stop using record store (#2307)
  • Add Operation::BatchInsert (#2308)

Refactor

  • Change OpIdentifier to RestartableState (#2263)
  • Asyncify Source (#2329)
  • Make source state source wide instead of per table (#2344)
  • Send source state and op id to sink (#2354)
  • Make connector methods take unique reference (#2358)
  • Remove dozer-lambda (#2380)
  • Respect source transaction boundaries (#2335)
  • Handle schema and types in aerospike (#2375)
  • Remove dozer-api (#2388)
  • Guarentee operation order inside a transaction (#2389)
  • Remove dozer-cache (#2390)

Support

Contact us at https://getdozer.io