memd — Prompt History¶
All structured prompts from this conversation, in order¶
Prompt 1 — Initial Situation and Goals¶
About Me: Ayush, a drop year student
Portfolio: ahhyoushh.github.io
Objective: Looking to make projects that will land an internship in research
labs, by cold mailing in interested fields. Targeting universities abroad.
Unclear about field to choose, not able to find research labs, and what's
the best in long term.
Interested fields:
Field: IoT, Avionics, Edge Avionics
Project idea: A UAV, making TX and receiver myself using ESP32 and SX modules,
LuckFox Pico Ultra or similar for object/entity tracking, making the PID
flight controller.
Fear: Too common, not much niche, not into aeronautics or mechanical engineering,
strictly ECE or CS. Also fears no good research labs will take a dropper.
Field: (less) EdgeAI, ML, Computer Vision
Project idea: A table viewer, views table from above, crops objects, generates
embeddings, semantic search through memory, using LuckFox or similar.
Fear: Too simple, not niche, not impressive, won't get upvotes, no credibility.
Job: Analyse situation, suggest better projects, which will get reach and be
interesting, not like a rover etc.
People I want to be like:
https://datavorous.github.io/
https://0xmukesh.github.io/
Prompt 2 — Clarifying Objective¶
I don't want foreign labs. I am applying to foreign unis after my gap year
so I need a research internship before that. Next fall 2027 I am applying for.
I find making a small inference runtime for YOLO or something, then using
that inference to make my table viewer the most interesting. I will also
like to build my own vector search library like spheni of datavorous too.
Give me a full plan, why you think fits the best, how to find research
papers and actually interesting research etc, how to become like Sagnik.
Prompt 3 — Does YOLO.C Already Exist?¶
Prompt 4 — Scoping the Project Stack¶
So I write my own inference, own quantisation and make my own CNN model?
We can skip vector search library then I guess or we can just brute force
and later implement?
Prompt 5 — ESP32 and NPU Question¶
If I could run it on ESP32 I really will want to. But isn't ESP32 CPU only?
What if I want to go into NPU?
Prompt 6 — Hardware Decision (Single Device)¶
Prompt 7 — Budget Expansion to 10k¶
Prompt 8 — Coral USB Clarification¶
Prompt 9 — What Do Research Labs Use?¶
Prompt 10 — Qualcomm and Lab Research¶
What about Qualcomm? Find me some research labs for me, tell me what they do,
what hardware they use, and pinpoint a good hardware for it.
Prompt 11 — ESP32 Inference Question¶
Prompt 12 — ESP32 Only Decision¶
Nah I can't find LuckFox, better do it all for ESP32. ESP32 is globally
recognised too, it will add weight to my posts.
Prompt 13 — Learning C and References¶
How do I learn C properly for this, like Beej's guide or something.
Use datavorous Reddit to find what he recommends, and what my prereqs for this will be:
https://www.reddit.com/user/Shonku_/
Output a references.md that will guide me to learn prerequisites.
Also timeline is messed up, its June 2 and applications start by Nov.
Prompt 14 — Is LuckFox Really the Best?¶
Prompt 15 — Custom Inference vs NPU Clarity¶
Prompt 16 — What Do Research Labs Actually Use?¶
Prompt 17 — Qualcomm and Specific Labs¶
What about Qualcomm? Find me some research labs for me, tell me what they do,
what hardware they use, and pinpoint a good hardware for it.
Prompt 18 — Rewrite All MDs for ESP32 Only¶
Only ESP32, cheaper and better. Rewrite all my MDs for this plan.
Also add some fun research papers like Attention Is All You Need,
I'll blog them on Twitter etc.
Keep the MDs structured, simple to understand.
prompts.md should basically contain all structured prompts I gave you.
Prompt 19 — Audio Pivot Question¶
Prompt 20 — Life Memory Device (from ChatGPT conversation)¶
[Uploaded ChatGPT memory export]
This was a chat I had with ChatGPT. I wanna make this — a device that keeps
listening to me and makes a graph with memories or something.
Prompt 21 — Form Factor and Sensor Scope¶
I'll just skip the camera then? And add a motion sensor or something,
so I can keep this like a bracelet? Or just audio is fine too?
Prompt 22 — Final Direction¶
It's not really a bracelet, just call it a device.
Write all three MDs properly, with proper interesting trending research
like Attention Is All You Need and some with AGI and privacy like the
big techs keep talking about.
Final Decisions Summary¶
Device : ESP32-S3-N16R8 + INMP441 I2S microphone
No camera, no motion sensor, audio only
~₹1500-1750 total
Project : Ambient Memory Device
Continuously captures audio events
Custom INT8 inference runtime in C (no ML frameworks)
Tiny CNN: log-mel spectrogram → event type + 32-dim embedding
Memory packets synced to local Python backend
Knowledge graph + cosine retrieval + optional local LLM
Research space : TinyML + episodic memory + privacy-preserving edge AI
Intersection of MCUNet-style systems work and
MemGPT/Generative Agents-style memory research
Research angle : "What is the minimum audio representation that preserves
sufficient semantic information for episodic memory
retrieval under severe hardware constraint?"
Target labs : IIIT-H CVIT + Qualcomm Edge AI Lab (primary)
IISc DESE, IIT Madras ESB (secondary)
Timeline : June → November 2026 applications
Blog Post 1: end of August (runtime + quantisation)
Blog Post 2: end of September (full system)
Cold mailing: September onwards
Form factor : Desktop device (not wearable)
Breadboard prototype first, packaging later
Key People¶
Sagnik Bhattacharya (datavorous) https://datavorous.github.io/ IIIT-H. Qualcomm NPU SDK work. Built spheni (vector search in C), 2KB chess engine. Pattern: well-known problem + severe constraint + from scratch + good writeup = reach.
Mukesh (0xmukesh) https://0xmukesh.github.io/ Generalist across unusual intersections. Biotech, crypto, computational biology. Credibility from shipping things real ecosystems used.
The Narrative Through All Decisions¶
The project evolved through this conversation:
UAV with ESP32 TX/RX
→ too mechanical, not ECE/CS enough
Table viewer with LuckFox
→ interesting but obscure hardware, no NPU story without LuckFox
Table viewer on ESP32 only
→ globally recognised hardware, stronger constraint story
ESP32 + audio (Shazam)
→ better community recognition, but generic if just classification
Ambient memory device (audio only, ESP32-S3)
→ unique application framing, directly aligned with frontier AI research
→ privacy angle is genuine and timely
→ research question is real and open
→ demo is visceral and explainable to anyone
Last updated: June 2026