Add community indexer #10680

rbennettcw · 2025-01-23T00:30:47Z

Link to Issue

Closes: #10618

TODO:

Verify community search works
Verify pagination performance
Verify clanker community metadata

Description of Changes

Adds standalone script to fetch all current clanker tokens and create a community for each
Adds cron job + community indexer policy which fetches clanker tokens and creates a community for each token found

Test Plan

Run migrations
Set envs:
- COMMUNITY_INDEXER_CRON='* * * * *'
- MAX_CLANKER_BACKFILL=1000
Run backfill script: pnpm backfill-clanker-tokens
- It should create 1000 new communities from the latest clanker tokens
Run message relayer and consumer – it should pull in the latest tokens every minute
- Check clanker.world home page to confirm that latest tokens show in logs every minute

Deployment Plan

Set envs:
- MAX_CLANKER_BACKFILL=0 – will backfill all tokens
Run the pnpm backfill-clanker-tokens script on a new heroku instance
When finished, set env:
- COMMUNITY_INDEXER_CRON='0 * * * *' – indexer will trigger every hour

Other Considerations

N/A

libs/model/src/policies/CommunityIndexer.policy.ts

Rotorsoft · 2025-01-23T22:12:36Z

libs/schemas/src/entities/community-indexer.schemas.ts

@@ -0,0 +1,25 @@
+import { z } from 'zod';
+
+export const CommunityIndexer = z.object({


do we need a new model for this? Looks like a cache for a retry utils, probably better in redis

It's state, but I wouldn't say it's a cache.

There are 50K clanker tokens to initially fetch, so there should be a robust way of tracking it. Also, there will be multiple community indexers in the near future for different sources. For each indexer, it'll fetch many tokens initially, then periodically fetch the newest tokens.

It works pretty much exactly the way the evm listener tracks the last block that it polled.

RE: using Redis

After thinking about it more, my main concern there is a lack of enforced typing and migrations. Right now, we just store the watermark and status, but if it ever becomes more sophisticated than that (which there's a good chance it will considering how quickly requirements change), we don't have a framework for migrating the cache– or I guess we'd just destroy the cache and refetch everything, which is going to eventually be 100K+ tokens from clanker + pump.fun– the choice is between being error-prone or inefficient.

Storing in postgres is a bit weird since it's infra state and not model state, but it's the best option for building something robust that handles future needs. The next best thing would be to have a separate PG DB for these things but that's overkill.

packages/commonwealth/server/bindings/bootstrap.ts

rbennettcw · 2025-01-27T23:54:10Z

Still need to fix the tags and image upload.

dillchen · 2025-02-03T21:13:31Z

let's wait to merge / deploy this until the other community homepage (product side tickets are merged in) because the communities won't be useful until then

rbennettcw · 2025-02-12T18:04:06Z

I have a bunch of conflicts to fix, but the main idea is there.

@timolegros before you dig into this– it's worth noting that the clanker API doesn't allow you to jump to a specific ID/timestamp or any sort of watermark. You can jump to an arbitrary page number, but that's useless if you don't know what on those pages. So the implementation reflects that limitation.

rbennettcw added 3 commits January 22, 2025 16:29

community indexer WIP

e7f65d4

add clanker fetch logic to community indexer

5de265d

wire up rabbitmq config

014ddaa

dillchen added this to the Community Homepage milestone Jan 23, 2025

tweaks and fixes

2e02ffd

rbennettcw requested a review from Rotorsoft January 23, 2025 22:03

Rotorsoft reviewed Jan 23, 2025

View reviewed changes

libs/model/src/policies/CommunityIndexer.policy.ts Outdated Show resolved Hide resolved

tweak default

e81461a

Rotorsoft reviewed Jan 23, 2025

View reviewed changes

libs/model/src/policies/CommunityIndexer.policy.ts Outdated Show resolved Hide resolved

Rotorsoft reviewed Jan 23, 2025

View reviewed changes

packages/commonwealth/server/bindings/bootstrap.ts Show resolved Hide resolved

rbennettcw added 11 commits January 24, 2025 11:35

cleanup retry logic

9defbed

create seperate backfill script

414b946

add optional max tokens constraint

3182891

make idempodent

e403ef3

merge master + tweak timestamp

09d379d

allow indexer to be disabled

f1b6ede

warn

1b050b5

handle image token upload

7a35c20

use node cron

b9a695d

comment

84a6cb0

add fk + index for community indexer

c7e58ae

rbennettcw marked this pull request as ready for review January 27, 2025 23:53

rbennettcw requested a review from kurtassad January 27, 2025 23:53

rbennettcw requested a review from Rotorsoft January 27, 2025 23:54

dillchen linked an issue Jan 31, 2025 that may be closed by this pull request

Clanker Indexing (stub) #10656

Open

mzparacha linked an issue Jan 31, 2025 that may be closed by this pull request

Generating Community Page #10378

Open

kurtassad approved these changes Jan 31, 2025

View reviewed changes

fix clanker id and name conflicts for backfill

771455f

rbennettcw added 3 commits February 3, 2025 12:40

improve watermarking

d8711f1

lint

df2ea65

lint

c3f7eea

rbennettcw requested a review from timolegros February 12, 2025 18:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add community indexer #10680

Add community indexer #10680

rbennettcw commented Jan 23, 2025 •

edited

Loading

Rotorsoft Jan 23, 2025

rbennettcw Jan 23, 2025

rbennettcw Jan 27, 2025

rbennettcw commented Jan 27, 2025

dillchen commented Feb 3, 2025

rbennettcw commented Feb 12, 2025

		@@ -0,0 +1,25 @@
		import { z } from 'zod';

		export const CommunityIndexer = z.object({

Add community indexer #10680

Are you sure you want to change the base?

Add community indexer #10680

Conversation

rbennettcw commented Jan 23, 2025 • edited Loading

Link to Issue

Description of Changes

Test Plan

Deployment Plan

Other Considerations

Rotorsoft Jan 23, 2025

Choose a reason for hiding this comment

rbennettcw Jan 23, 2025

Choose a reason for hiding this comment

rbennettcw Jan 27, 2025

Choose a reason for hiding this comment

rbennettcw commented Jan 27, 2025

dillchen commented Feb 3, 2025

rbennettcw commented Feb 12, 2025

rbennettcw commented Jan 23, 2025 •

edited

Loading