17 KiB
description |
---|
Aqua implementation of Fluence Registry and ResourcesAPI |
@fluencelabs/registry
Fluence Registry is an essential part of the Fluence network protocol. It provides a Resources API that can be used for service advertisement and discovery. For more details check out our community call.
Releases
You can find the latest registry
release on NPM and the changelogs are in the GitHub repo.
API
For the API implementation, take a look at resources-api.aqua in the registry
repo.
Terminology
- Registry - a service that provides low-level API.
resources-api.aqua
is built on top of it. - @fluencelabs/registry - an Aqua library on NPM that provides high-level and low-level APIs to develop custom registry scripts.
- Resource/Provider - a pattern for peer/service advertisement and discovery. Providers register for a Resource that can be discovered by its resource_id.
- Kademlia - an algorithm for structuring a peer-to-peer network so that peers can find each other efficiently, i.e., in no more than O(logN) hops where N is the total number of peers in the network.
- Resource – a
string
label with associatedowner_peer_id
and a list of providers. A resource should be understood as a group of services or a group of peers united by some common feature. In low-level API Resource is a Registry key. - Registry Key - a structure, signed by
owner_peer_id
, which holds information about Resource:
data Key:
id: string
label: string
owner_peer_id: string
timestamp_created: u64
challenge: []u8
challenge_type: string
signature: []u8
- resource_id - a stable identifier created from the hash of
label
andowner_peer_id
used to identify any resource.id
field of RegistryKey
- Resource owner - the
owner_peer_id
that created the resource. Other users can create resources with the same label but the identifier will be different because of theowner_peer_id.
- challenge/challenge_type – dummy fields which will be used for permission management in the next Registry iterations.
- Provider – a peer which is registered as a resource provider, optionally with an associated relay_id and service_id. Each provider is associated with a Registry record.
- Registry record - a structure, signed by
set_by
peer_id, which holds information about Provider:
data Record:
key_id: string
value: string
peer_id: string
set_by: string
relay_id: []string
service_id: []string
timestamp_created: u64
solution: []u8
signature: []u8
- provider's value – any string which can be defined and used in accordance with protocol requirements.
- provider's peer_id – peer id of provider of this resource.
- provider's relay_id - optional, set if provider is available on the network through this relay.
{% hint style="info" %}
When a provider doesn't have a publicly accessible IP address, e.g. the client peer is a browser, it connects to the network through a relay node. That means that other peers only can connect to this provider through a relay. In that case,registerProvider implicitly set relay_id to
HOST_PEER_ID
.
{% endhint %}
- provider's service_id - optional, id of the service of that provider.
- solution – dummy field, will be used for permission checking in the next Registry iterations.
- provider limit - a resource can have at most 32 providers. Each new provider added after the provider limit has been reached results in removing an old provider following the FIFO principle. Soon provider's prioritization will be handled by TrustGraph.
- host provider's record - a Registry record with
peer_id
of a node. When a node is registered as a provider viaregisterNodeProvider
orcreateResourceAndRegisterNodeProvider
, the record is a host record. Host records live through garbage collection, unlike other Registry records. See Register As A Provider for details. - resource and provider lifetime - a resource and provider record are **** republished every **** 1 hour and evicted, i.e. removed, 24 hours after being unused.
{% hint style="info" %} So there are two types of providers. First is a node provider which lifetime controlled by this node. Second is a JS peer provider and should be renewed periodically by this peer. {% endhint %}
- script caller - a peer that executes a script by sending it to the network. In Aqua it's
INIT_PEER_ID
- node - usually a Fluence node hosted by the community or Fluence Team. Nodes are long-lived, can host WebAssembly services and participate in the Kademlia network.
How To Use Registry
{% hint style="info" %}
There are several simple examples in the fluencelabs/registry
repo. Give them a look.
{% endhint %}
Create A Resource
Before registering as a provider is possible, resource must be created. That's exactly what createResource
does.
Here's a simple Aqua example:
import "@fluencelabs/registry/resources-api.aqua"
import "@fluencelabs/aqua-lib/builtin.aqua"
func my_function(label: string) -> ?string, *string:
resource_id, errors <- createResource(resource)
if resource_id != nil:
-- resource created successfully
Op.noop()
else:
-- resource creation failed
Op.noop()
<- resource_id, errors
Register As A Provider
There are four functions that register providers. Let's review them.
These you would use for most of your needs:
registerProvider
- registersINIT_PEER_ID
as a provider for existent resource.createResourceAndRegisterProvider
- creates a resource first and then registersINIT_PEER_ID
as a provider for it.
And these are needed to register a node provider for a resource:
registerNodeProvider
- registers the given node as a provider for an existing resource.createResourceAndRegisterNodeProvider
- creates a resource first and then registers the given node as a provider.
Now, let's review them in more detail.
createResourceAndRegisterProvider
& registerProvider
These functions register the caller of a script as a provider:
createResourceAndRegisterProvider
creates a resource prior to registrationregisterProvider
simply adds a registration as a provider for existing resource.
createResourceAndRegisterNodeProvider
& registerNodeProvider
These two functions work almost the same as their non-Node
counterparts, except that they register a node instead of a caller. This is useful when you want to register a service hosted on a node.
Records created by these two functions live through garbage collection unlike records created by registerProvider.
Here's how you could use it in TypeScript:
{% hint style="info" %}
You first need to have export.aqua
file and compile it to TypeScript, see here
{% endhint %}
import {Fluence, KeyPair} from "@fluencelabs/fluence";
import { krasnodar } from "@fluencelabs/fluence-network-environment";
import {registerNodeProvider, createResource, registerProvider, resolveProviders} from "./generated/export";
import assert from "assert";
async function main() {
// create the first peer and connect it to the network
await Fluence.start({ connectTo: krasnodar[1] });
console.log(
"📗 created a fluence peer %s with relay %s",
Fluence.getStatus().peerId,
Fluence.getStatus().relayPeerId
);
let label = "myLabel";
console.log("Will create resource with label:", label);
let [resource_id, create_error] = await createResource(label);
assert(resource_id !== null, create_error.toString());
console.log("resource %s created successfully", resource_id);
let value = "myValue";
let node_provider = krasnodar[4].peerId;
let service_id = "identity";
let [node_success, reg_node_error] = await registerNodeProvider(node_provider, resource_id, value, service_id);
assert(node_success, reg_node_error.toString());
console.log("node %s registered as provider successfully", node_provider);
let [success, reg_error] = await registerProvider(resource_id, value, service_id);
console.log("peer %s registered as provider successfully", Fluence.getStatus().peerId);
assert(success, reg_error.toString());
let [providers, error] = await resolveProviders(resource_id, 2);
// as a result we will see two providers records
console.log("resource providers:", providers);
}
main().then(() => process.exit(0))
.catch(error => {
console.error(error);
process.exit(1);
});
Renew Record Periodically
After a non-host record is created, it must be used at least once an hour to keep it from being marked stale and deleted. Also, peers must renew themselves at least once per 24 hours to prevent record expiration and deletion.
While this collection schedule may seem aggressive, it keeps the Registry up-to-date and performant as short-lived client-peers, such as browsers, can go offline at any time or periodically change their relay nodes.
Call A Function On Resource Providers
executeOnProviders
registry
provides a function to callback on every Record associated with a resource:
func executeOnProviders(resource_id: string, ack: i16, call: Record -> ())
It reduces boilerplate when writing an Aqua script that calls a (common) function on each provider. For example:
import "@fluencelabs/registry/resources-api.aqua"
-- API that every subscriber must adhere to
-- You can think of it as an application protocol
service ProviderAPI:
do_smth(value: string)
func call_provider(p: Record):
-- topological move to provider via relay
on p.peer_id via p.relay_id:
-- resolve service on a provider
ProviderAPI p.service_id
-- call function
ProviderApi.do_smth(p.value)
-- call ProviderApi.do_smth() on every provider
func call_everyone(resource_id: String, ack: i16):
executeOnProviders(resource_id, ack, call_provider)
Passing Data To Providers
Due to the limitations in callbacks, executeOnProviders
doesn't allow us to send dynamic data to providers. However, this limitation is easily overcome by using a for
loop:
Consider this Aqua code:
import "@fluencelabs/registry/resources-api.aqua"
-- Application event
data Event:
value: string
-- API that every provider must adhere to
-- You can think of it as an application protocol
service ProviderAPI:
receive_event(event: Event)
func call_provider(p: Record, event: Event):
-- topological move to provider via relay
on p.peer_id via p.relay_id:
-- resolve service on a provider
ProviderAPI p.service_id
-- call function
ProviderAPI.receive_event(event)
-- send event to every provider
func send_everyone(resource_id: String, ack: i16, event: Event):
-- retrieve all providers of a resource
providers <- resolveProviders(resource_id, ack)
-- iterate through them
for p <- providers par:
call_provider(p, event)
Handling Function Calls
Fluence JS SDK allows JS/TS peers to define their API through services and functions.
Let's take the ProviderApi
from the previous example and extend it a little:
data Event:
value: string
service ProviderAPI:
-- receive an event
receive_event(event: Event)
-- do something and return data
do_something(value: string) -> u64
Let's save this file to provider_api.aqua
and compile it
aqua -i . -o src/generated
import { Fluence } from "@fluencelabs/fluence";
import { krasnodar } from "@fluencelabs/fluence-network-environment";
import { registerProviderAPI, ProviderAPIDef } from "./generated/provider_api"
async function main() {
await Fluence.start({ connectTo: krasnodar[2] });
let service_id = 'api';
let counter = 0;
await registerProviderAPI(service_id, {
receive_event: (event: any) => {
console.log("event received!", event);
},
do_something: (value: any) => {
counter += 1;
console.log("doing logging!", value, counter);
return counter;
}
});
}
main().then(() => process.exit(0))
.catch(error => {
console.error(error);
process.exit(1);
});
Overcoming The Record Limit
If your app requires more than 32 providers for a single resource, then it's time to think about a custom WebAssembly service that stores all these records. Basically a simple "records directory" service.
With such a service implemented and deployed, you can use resources-api.aqua
to register that "records directory" service and host as provider. Depending on your app's architecture, you might want to have several instances of "records directory" service.
The code to get all records from "directory" services might look something like this in Aqua:
import "@fluencelabs/registry/resources-api.aqua"
service RecDir:
get_records(resource_id: string) -> []Record
func dir_subscribers(resource_id: String, ack: i16) -> [][]Record:
-- this stream will hold all records
allRecs: *[]Record
-- retrieve RecDir records from Registry
providers <- resolveProviders(resource_id, ack)
-- iterate through all RecDir services
for dir <- providers:
on dir.peer_id:
RecDir dir.service_id
-- get all records from RecDir and write to allRecs
allRecs <- SubDir.get_records(resource_id)
<- allRecs
Concepts
Kademlia Neighborhood
Fluence nodes participate in the Kademlia network. Kademlia organizes peers in such a way that given any key, you can find a set of peers that are "responsible" for that key. That set contains up to 20 nodes.
That set is called "neighborhood" or "K-closest nodes" (K=20). In Aqua, it is accessible in aqua-lib
via the Kademlia.neighbourhood
function.
The two most important properties of the Kademlia neighborhood are:
1) it exists for any key
2) it is more or less stable
Data Replication
On write
When a registration as a provider for a resource is done, it is written to the Kademlia neighborhood of that resource_id. Here's a registerProvider
implementation in Aqua:
-- Register for a resource as provider
-- Note: resource must be already created
func registerProvider(resource_id: ResourceId, value: string, service_id: ?string) -> bool, *Error:
success: *bool
error: *Error
relay_id: ?string
relay_id <<- HOST_PEER_ID
t <- Peer.timestamp_sec()
on HOST_PEER_ID:
record_sig_result <- getRecordSignature(resource_id, value, relay_id, service_id, t)
if record_sig_result.success == false:
error <<- record_sig_result.error!
success <<- false
else:
record_signature = record_sig_result.signature!
-- find resource
key, error_get <- getResource(resource_id)
if key == nil:
appendErrors(error, error_get)
success <<- false
else:
successful: *bool
-- get neighbourhood for the resource_id
nodes <- getNeighbours(resource_id)
-- iterate through each node in the neighbourhood
for n <- nodes par:
error <<- n
on n:
try:
-- republish key/resource
republish_res <- republishKey(key!)
if republish_res.success == false:
error <<- republish_res.error
else:
-- register as a provider on each node in the neighbourhood
put_res <- putRecord(resource_id, value, relay_id, service_id, t, record_signature)
if put_res.success:
successful <<- true
else:
error <<- put_res.error
timeout: ?string
-- at least one successful write should be performed
join successful[0]
par timeout <- Peer.timeout(5000, "provider hasn't registered: timeout exceeded")
if timeout == nil:
success <<- true
else:
success <<- false
error <<- timeout!
<- success!, error
This ensures that data is replicated across several peers.
At rest
Resource Keys and Provider records are also replicated "at rest". That is, once per hour all stale keys and records are removed and replicated to all nodes in the neighborhood, once per day all expired keys and records are removed. ****
This ensures that even if a neighborhood for a resource_id has changed due to some peers go offline and others join the network, data will be replicated to all nodes in the neighborhood.
...
{% hint style="info" %} For advanced users accustomed to Aqua scripts:
There's an implementation of "at rest" replication for Registry on GitHub {% endhint %}