Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Allow nested arrays and vectors in Brillig foreign calls #4404

Closed

Conversation

ggiraldez
Copy link
Contributor

Description

This is a mirror of noir-lang/noir#4053. From the discussion there, it was suggested that we should open the PR on this repository instead, due to the changes in the serialization format.

Problem*

Resolves noir-lang/noir#4052

The Brillig VM has no knowledge of the data types of array/slices elements, but it also references arrays and vectors by a pointer to a small structure in memory. This results in incorrect data passed to foreign functions when the values consist of arrays (or slices) nested inside other arrays (or slices).

Summary*

⚠️ THIS PR CHANGES ACIR SERIALIZATION FORMAT

This PR adds element type descriptors to HeapArray and HeapVector Brillig structures to allow the code that gathers the foreign call parameters to correctly interpret the data to be read from the VM memory.

Additional Context

The counterpart of is returning nested arrays/slices from foreign calls, which this PR does not implement. The existing foreign functions (and the ones required by the debugger oracle) do not require this, so it's not currently a problem.

Furthermore, implementing that feature would require some design decisions regarding how are the arrays/slices allocated in memory and by who.

To return arrays the compiler now generates code to pre-allocate the heap array, adjusting the heap pointer register and setting a register for the VM to write the results into memory. For nested arrays, it would need to pre-allocate the outer as well as the inner arrays and write the pointers to the inner arrays into memory, all before making the foreign call.

When adding slices (heap vectors) to the mix, it gets more complicated because they are dynamically sized, so it wouldn't be possible for the compiler to pre-allocate the inner arrays/slices. Also, for slices the compiler will allocate a register for the VM to write the size of the returned result, which it uses to generate code to adjust the heap pointer register after the call is complete.

It's unclear whose responsibility would be to allocate heap space when mixing arrays and slices, and the compiled code should handle it. A simpler approach may be for the compiler to generate the foreign call passing the heap pointer register and delegating all the allocations and heap pointer adjustments to the VM implementation. That would simplify generated Brillig code and move the burden on to the VM implementation.

Documentation*

Check one:

  • No documentation needed.
  • Documentation included in this PR.
  • [Exceptional Case] Documentation to be submitted in a separate PR.

PR Checklist*

  • I have tested the changes locally.
  • I have formatted the changes with Prettier and/or cargo fmt on default settings.

@sirasistant
Copy link
Collaborator

I think acir.hpp in barretenberg needs to be updated with the changes in acir.cpp. Keep in mind that you'll need to replace the throw serde::deserialization_error(... with throw_or_abort(...

ForeignCallParam::Single(value) => {
self.memory.write(*value_index, *value);
ValueOrArray::MemoryAddress(value_index) => {
assert!(*value_type == HeapValueType::Simple);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rather than having these asserts on value_type couldn't we match against (destination, value_type) and then have one failure case for the match statement about having a mismatched type.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, that's a good idea! Thanks!

match input {
ValueOrArray::MemoryAddress(value_index) => self.memory.read(value_index).into(),
ValueOrArray::MemoryAddress(value_index) => {
assert!(*value_type == HeapValueType::Simple);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment about the asserts on value_type as in process_opcode

@vezenovm
Copy link
Contributor

vezenovm commented Feb 5, 2024

Some minor comments, overall looks ready to merge after the serialization is updated.

Addressing the code review comments, match on value and type simultaneously
instead of just the value and then asserting on the type.
@ggiraldez ggiraldez requested a review from vezenovm February 5, 2024 21:01
Comment on lines 1115 to 1120
std::vector<Circuit::Opcode> opcodes;
Circuit::ExpressionWidth expression_width;
std::vector<Circuit::Witness> private_parameters;
Circuit::PublicInputs public_parameters;
Circuit::PublicInputs return_values;
std::vector<std::tuple<Circuit::OpcodeLocation, std::string>> assert_messages;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this changes need to be removed! Also it's missing passing the formatter on the whole file

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To clarify, it should look like the previous diff without the Circuit:: prefixes.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm... I thought this would have the same contents as the generated acir.cpp with the previously mentioned replacement only. Do you have the procedure to update this file documented somewhere?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ggiraldez unfortunately the documentation on this process is lacking. The replacement mentioned here is the only other place. I'm going to quickly do the serialization for you and also make an issue to document this process.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I was able to fix this one, and found the format.sh script in the directory. I'll push a commit with the changes, but I'm unsure it's working properly. Locally, I managed to get the build dependencies installed for Barrentenberg, but it's still complaining about a different thing somewhere else in the codebase. I'm sure I'm missing some language dependency or something.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok the committed changes look correct. Let's see what CI says. I will document the process for any future updates

@vezenovm vezenovm enabled auto-merge (squash) February 6, 2024 18:30
auto-merge was automatically disabled February 6, 2024 20:35

Head branch was pushed to by a user without write access

@vezenovm vezenovm enabled auto-merge (squash) February 6, 2024 20:38
@TomAFrench TomAFrench disabled auto-merge February 7, 2024 09:28
Copy link
Member

@TomAFrench TomAFrench left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Merging this is going to delay us being able to sync the repos until we have another release of aztec-packages. If we're not going to do an extra early release then we should hold off on this.

Copy link
Member

@TomAFrench TomAFrench left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removing block as we're going to do an extra release.

sirasistant added a commit that referenced this pull request Feb 7, 2024
This PR mirrors the changes of
#4404 to make them
easier to merge

---------

Co-authored-by: Gustavo Giráldez <[email protected]>
@sirasistant
Copy link
Collaborator

Merged here #4478

@ggiraldez
Copy link
Contributor Author

Merged here #4478

Thank you!

@ggiraldez ggiraldez closed this Feb 7, 2024
TomAFrench pushed a commit that referenced this pull request Feb 7, 2024
This PR mirrors the changes of
#4404 to make them
easier to merge

---------

Co-authored-by: Gustavo Giráldez <[email protected]>
AztecBot pushed a commit to AztecProtocol/barretenberg that referenced this pull request Feb 8, 2024
This PR mirrors the changes of
AztecProtocol/aztec-packages#4404 to make them
easier to merge

---------

Co-authored-by: Gustavo Giráldez <[email protected]>
michaelelliot pushed a commit to Swoir/noir_rs that referenced this pull request Feb 28, 2024
…Protocol#4478)

This PR mirrors the changes of
AztecProtocol#4404 to make them
easier to merge

---------

Co-authored-by: Gustavo Giráldez <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

Cannot pass nested arrays/slices to foreign functions
4 participants