[Iter-X] 52/100days

Note: This article was translated for me by AI. I wrote the original in Chinese. I never use AI to write my articles, because that would cost me my own expression; my freedom to express myself is always the most valuable part of my work. So if you can read Chinese, I recommend reading the Chinese version, where you will get the most original and unfiltered version. That said, technological progress exists to give us more convenience, so I will continue using AI to translate my writing into multiple languages, allowing valuable content to reach more people.

Day5️⃣2️⃣

We had an online meeting today to discuss the logic behind data cleaning. The scale of the collected POI (Points of Interest) data is quite large. A complete POI includes basic information such as name, description, address, and coordinates, as well as some value-added information like recommended duration of visit, rating, etc. Some of this data can be collected at scale from public datasets, while the rest needs to be supplemented through other means:

1️⃣ Due to the volume, manual work is out of the question initially (although it’s worth noting that manual input can be a viable method—especially at full user scale, users themselves act as data curators, which is the approach many mature platforms take. User-maintained data is often the most accurate).

2️⃣ The conventional approach is using web crawlers to gather data from various sources. Some platforms provide this type of data.

3️⃣ Nowadays, a lot of this can actually be done with LLMs. We’ve estimated that supplementing one POI consumes roughly 1,000 tokens, so even at the scale of hundreds of thousands of POIs, the cost remains quite low.

There’s a lot of nuance in this process, but for now, we’re focusing on the MVP. Optimization can come later.

Today, we finally integrated all the Agents, Tools, and Prompts into the database. This will make it easier to manage and switch between different versions. Next, we’ll build a simple admin dashboard for easy management and prompt tuning. We also plan to create a Trajectory Visualization interface for visualizing the Agent call chains, which will make debugging and tracking much more convenient.

Current Progress Overview:

  1. Prototype Design & UI/UX: 33%
  2. Backend (Go) Development: 50%
  3. Client-side (Flutter) Progress: 42%
  4. Data: 12%

If you think you match the following, we’d love to talk:

  1. Persistent
  2. Dream-driven
  3. Interested



Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • LLM Infra 101 v0.0: Model Inference
  • Daily Harness
  • Browser Use Deep Dive
  • The Primitive Urge to Solve Problems
  • How We Are Speedrunning the AI Era