Replies: 5 comments 16 replies
-
We have two fields in the event that can potentially be used for this -
Yes, this can be done. My initial thought is to crawl for all of Lens and see the performance. If we hit significant problems then we can limit to only known musicians. CC: @Watcher-eth |
Beta Was this translation helpful? Give feedback.
-
Good call. No need to fix it if it isn't broken. |
Beta Was this translation helpful? Give feedback.
-
I have started stress testing the rough Lens implementation I have written. In around 5 hours Lens protocol sees 7000 posts. I feel that our crawler is robust, now that we have concurrency and retry functionalities in extraction-worker. So, given enough time we should be able to crawl Lens completely. Currently, it is taking 4 minutes to crawl 7000 posts. Therefore, it will take 12 mins to crawl all posts in a day and 6 hours to crawl all posts created in a month. This is slow. If someone was starting their crawl from the starting it will take 4-6 days, assuming we have to crawl a year worth of posts. Also worth noting that if someone does not use their own Arweave/IPFS gateway or a public gateway they will encounter a lot of costs. Right now, I am inclining towards the approach @neatonk proposed that is to limit crawl to known musicians. What do you all think? (@djfnd) |
Beta Was this translation helpful? Give feedback.
-
Hey guys great discussion. I agree no need to crawl all of lens with the
speed at how things Safe growing this is gonna her costly very fast!
You can actually even Index by App as the api and schemas have a source or
appID resolver that let’s you filter by app. In our case it would be
"beats" and our new one is "riff"
Any news on the feed strategy? Getting music chronologically based on an
array of wallet addresses.
Will talk with Dan later this week too.
Hope everyone is doing good.
Let me know if I can be of any assistance.
Watcher
…On Fri 12. May 2023 at 05:37, Dan Fowler ***@***.***> wrote:
That sounds like a good idea to me. I will spin up a thread on how to use
the money when we have a clear idea about exactly what it'll be, but I
reckon it'll be of the order of $2.5k. That said, as this is a Lens
specific thing, then I think there's also good rationale to put some of the
next month's tranch of Lens grant towards it.
—
Reply to this email directly, view it on GitHub
<#24 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AZXIWAC6A3CACUP5BOEKTTTXFYVHDANCNFSM6AAAAAAXUH2RDY>
.
You are receiving this because you were mentioned.Message ID:
***@***.***
com>
|
Beta Was this translation helpful? Give feedback.
-
FYI I have made a post in the lens dev tg group and linked to this discussion to seek some feedback and input |
Beta Was this translation helpful? Give feedback.
-
The scale of the Lens protocol is much greater than what we have seen with existing protocols such as Sound.
My proposed approach for crawling
We can listen for the PostCreated event. The event includes a content URI and upon visiting the link we get the metadata. The metadata has information if the post is audio or not. We have to download the metadata for each post.
Downloading the metadata and checking for each post will test our scaling capability.
I haven't checked if Lens API provides a better way to sort through posts. I have only looked through on-chain events as they are more preferable than centralized APIs.
Potential solutions to reduce information
Beta Was this translation helpful? Give feedback.
All reactions