Our paper proposing the Text-Twin-Translation (T3) framework has been accepted to the SIGKDD 2026 AI4Science Track (CORE A*). The same work was a Spotlight Oral at the ICLR 2026 AI4Mat workshop, presented recently in Brazil.
What T3 does
T3 is our answer to the data scarcity that pervades ML for complex nanomaterial/device applications. The workflow:
- Automated prompt optimization via TextGrad. A text-gradient method drives an LLM-based agentic pipeline that extracts structured knowledge graphs from an unstructured publication corpus, cheaply and at high throughput.
- Device-topology-aware Digital Twin. A Graph Neural Network that bakes in device-topology physical constraints, trained as a Digital Twin to predict coupled material–device performance.
- Validation on an OOD downstream task. We apply the framework to designing and screening FET sensor probes for detecting PFAS in water.
We built it so the same pipeline can be pointed at other data-scarce material/device problems.
Links
Thanks to my collaborators and to Schmidt Sciences for supporting this work.