Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

273 patch partitioningspartidparents #302

Open
wants to merge 7 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .github/CODEOWNERS
Original file line number Diff line number Diff line change
@@ -1 +1 @@
* @benedeki @lsulak @TebaleloS @Zejnilovic @dk1844 @salamonpavel
* @benedeki @lsulak @TebaleloS @Zejnilovic @dk1844 @salamonpavel @ABLL526
7 changes: 7 additions & 0 deletions database/README.md
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can be ignored.
For my use.

Original file line number Diff line number Diff line change
Expand Up @@ -32,3 +32,10 @@ to remove them or
sbt flywayBaseline
```
to set the current state as the baseline.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This exact information is already in the readme file. Remove it.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @salamonpavel will remove the file from the PR.

```zsh
docker kill atum_db
docker rm atum_db
docker run --name=atum_db -e POSTGRES_PASSWORD=postgres -e POSTGRES_DB=atum_db -p 5432:5432 -d postgres:16
sbt flywayMigrate
```
141 changes: 141 additions & 0 deletions database/src/main/postgres/runs/V0.3.0.1__get_ancestors.sql
Original file line number Diff line number Diff line change
@@ -0,0 +1,141 @@
/*
* Copyright 2021 ABSA Group Limited
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
* You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/

CREATE OR REPLACE FUNCTION runs.get_ancestors(
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
IN i_partitioning_id BIGINT,
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
IN i_limit INT DEFAULT 5,
IN i_offset BIGINT DEFAULT 0,
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
OUT status INTEGER,
OUT status_text TEXT,
OUT ancestor_id BIGINT,
OUT partitioning JSONB,
OUT author TEXT,
OUT has_more BOOLEAN
) RETURNS SETOF record AS
$$
-------------------------------------------------------------------------------
--
-- Function: runs.get_ancestors(1)
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
-- Returns Ancestors' partition ID for the given id
--
-- Parameters:
-- i_partitioning_id - id that we asking the Ancestors for
-- i_limit - (optional) maximum number of partitionings to return, default is 5
-- i_offset - (optional) offset to use for pagination, default is 0
--
-- Returns:
-- status - Status code
-- status_text - Status message
-- ancestor_id - ID of Ancestor partition
-- partitioning - partitioning data of ancestor
-- author - author of the Ancestor partitioning
-- has_more - Flag indicating if there are more partitionings available

-- Status codes:
-- 11 - OK
-- 41 - Partitioning not found
--
-------------------------------------------------------------------------------
DECLARE
flow_id BIGINT[];
var BIGINT;
partitionCreateAt TIMESTAMP;
_has_more BOOLEAN;

BEGIN

PERFORM 1 FROM runs.partitionings WHERE id_partitioning = i_partitioning_id;
IF NOT FOUND THEN
status := 41;
status_text := 'Child Partitioning not found';
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
has_more := FALSE;
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
RETURN NEXT;
RETURN;
END IF;

SELECT created_at FROM runs.partitionings WHERE id_partitioning = i_partitioning_id INTO partitionCreateAt;

flow_id := array(
SELECT fk_flow AS flow_id
FROM flows.partitioning_to_flow
WHERE fk_partitioning = i_partitioning_id
);

IF cardinality(flow_id) = 0
THEN
status := 41;
status_text := 'Flow not found';
has_more := FALSE;
RETURN NEXT;
RETURN;
END IF;

FOREACH var IN ARRAY flow_id LOOP
IF i_limit IS NOT NULL THEN
SELECT count(*) > i_limit
FROM flows.partitioning_to_flow PTF
WHERE PTF.fk_flow = var
LIMIT i_limit + 1 OFFSET i_offset
INTO _has_more;
ELSE
_has_more := false;
END IF;
END LOOP;

RETURN QUERY
SELECT
11 AS status,
'OK' AS status_text,
P.id_partitioning AS ancestor_id,
P.partitioning AS partitioning,
P.created_by AS author,
_has_more AS has_more
FROM
runs.partitionings P
INNER JOIN
flows.partitioning_to_flow PF
ON PF.fk_partitioning = P.id_partitioning
INNER JOIN
(SELECT * FROM flows.partitioning_to_flow OLD_PF WHERE OLD_PF.fk_partitioning = i_partitioning_id) PF2
ON Pf2.fk_flow = PF.fk_flow
WHERE
P.id_partitioning < i_partitioning_id
AND
P.created_at <= partitionCreateAt
GROUP BY P.id_partitioning
ORDER BY
P.id_partitioning,
P.created_at DESC
LIMIT i_limit
OFFSET i_offset;

IF FOUND THEN
status := 11;
status_text := 'OK';
ELSE
status := 41;
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
status_text := 'Ancestor Partitioning not found';
has_more := FALSE;
END IF;
RETURN NEXT;
RETURN;

END;
$$
LANGUAGE plpgsql VOLATILE SECURITY DEFINER;

ALTER FUNCTION runs.get_ancestors(BIGINT, INT, BIGINT) OWNER TO atum_owner;
GRANT EXECUTE ON FUNCTION runs.get_ancestors(BIGINT, INT, BIGINT) TO atum_user;
Original file line number Diff line number Diff line change
@@ -0,0 +1,98 @@
/*
* Copyright 2021 ABSA Group Limited
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
* You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/

CREATE OR REPLACE FUNCTION runs.patch_partitioning_parent(
Copy link
Collaborator

@salamonpavel salamonpavel Nov 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not sure about what the function does and is supposed to do. I thought we should check that partitioning and parent partitioning exists and then simply add the relationship (if not already there) using _add_to_parent_flows.sql ... @benedeki @lsulak

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The function should set the relationship between the two partitions, considering the conditions are set. And based on input parameter(s) copy the metadata from parent to child.
Conditions to check:

  • partition exists
  • parent partition exists
  • the relationship does not exist already (then it's not an error, just no-op)
  • a reverse relationship does not exist
    Metadata to copy, if input parameters says so:
  • measures
  • additional data

IN i_partitioning_id BIGINT,
IN i_parent_id BIGINT,
IN i_by_user TEXT,
OUT status INTEGER,
OUT status_text TEXT,
OUT parent_id BIGINT
) RETURNS record AS
ABLL526 marked this conversation as resolved.
Show resolved Hide resolved
$$
-------------------------------------------------------------------------------
--
-- Function: runs.patch_partitioning_parent(1, 2, user_1)
-- Returns Parents' partition ID for the given id.
-- Updates the Parent Partition for a given partition ID.

--
-- Parameters:
-- i_partitioning_id - id of the partition to be changed
-- i_parent_id - id of the new parent of the partition,
-- i_by_user - user behind the change
--
-- Returns:
-- status - Status code
-- status_text - Status message
-- parent_id - ID of Parent partition

-- Status codes:
-- 11 - OK
-- 41 - Partitioning not found
--
-------------------------------------------------------------------------------
DECLARE
flow_id BIGINT[];
mainFlow BIGINT;
var BIGINT;

BEGIN

PERFORM 1 FROM runs.partitionings WHERE id_partitioning = i_partitioning_id;
IF NOT FOUND THEN
status := 41;
status_text := 'Child Partitioning not found';
RETURN;
END IF;

PERFORM 1 FROM runs.partitionings WHERE id_partitioning = i_parent_id;
IF NOT FOUND THEN
status := 41;
status_text := 'Parent Partitioning not found';
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same here, we could distinguish was was not found.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed

RETURN;
END IF;

SELECT F.id_flow as mainFlow
FROM runs.get_partitioning_main_flow(i_partitioning_id) AS F
INTO mainFlow;

flow_id := array(
SELECT fk_flow AS flow_id
FROM flows.partitioning_to_flow
WHERE fk_partitioning = i_partitioning_id
AND fk_flow != mainFlow
);

FOREACH var IN ARRAY flow_id LOOP
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

loops could be removed

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Work with Arrays in PG is nto very effective. It's discouraged to operate on them, only use them as input or output.
Particularly bad is to do a FOR loop over them.

DELETE FROM flows.partitioning_to_flow AS PTF
WHERE PTF.fk_partitioning = i_partitioning_id
AND PTF.fk_flow = var;
END LOOP;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While you are doing this. It's not needed, IMHO.


PERFORM 1 FROM flows._add_to_parent_flows(i_parent_id, i_partitioning_id, i_by_user);
status := 11;
status_text := 'Parent Updated';
parent_id := i_parent_id;

RETURN;

END;
$$
LANGUAGE plpgsql VOLATILE SECURITY DEFINER;

ALTER FUNCTION runs.patch_partitioning_parent(BIGINT, BIGINT, TEXT) OWNER TO atum_owner;
GRANT EXECUTE ON FUNCTION runs.patch_partitioning_parent(BIGINT, BIGINT, TEXT) TO atum_user;
Loading