Looking back on our experience building an AI agent for creativity — what we learned, what worked, and where we go next.
Feb 05, 2025
Last year, we set out to build AI agents capable of creating visual art and poetry. Our focus was on exploring artificial creativity and gaining hands-on experience in shaping an agent's persona and style through iterative development.
We also wanted to deepen our expertise in generating images with open-source diffusion models and explore how tailored agents could enhance prompt-driven image generation.
We aimed to create two distinct agent personas:
We wanted to understand how feedback loops in system messages and LoRA fine-tuning shape an agent's style and personality, and how the combination of both influences the creative process.
Additionally, we explored how these agents could be applied at various stages in the creative workflow:
We created two distinct projects with these models:
"Lost Shards" is a project developed with tengri. The key artistic concept behind "Lost Shards" is visualizing healing through reliving old memories. This was a deeply spiritual endeavor, and we approached the project accordingly.
Tengri was configured through system prompting, first by defining its personality traits, focusing on spirituality and psychology. Over time, the system message was enriched with factual references from spiritual traditions.
We used user prompts to guide how the agent generated prompts for image creation. The most effective variations were integrated into the system prompts, forming the final agent configuration.
The result is a collection of 14 artworks, available on Foundation. We are extremely proud that "Consumed" was featured on the cover of The Future of Art: Best Digital Artists of Our Generation.
We used fr33d4 to create "Paints from the Oops", a somewhat offbeat collection of visuals and poetry.
While tengri was shaped exclusively through system prompting, fr33d4 was tuned using both the system message and LoRA fine-tuning. This decision was made because we needed a distinct, short, direct verbal style that wasn't achievable through system prompts alone.
The style of the base LLM models played a crucial role in the final outcome, making stronger personalization necessary.
To achieve this, we fine-tuned the model using texts from philosophy and literature with a similar tone and structure. The results were published as several visual drops on Drip.haus and poetry on Objkt.com.
Here's an example of a poem created with fr33d4:
### This is Our Home we think of this for two days, and on the third we go to the Theatre of the Oops, where we have never been before. and it is wonderful. there is the stage. there are the seats. and there is the light on the faces of the judges, which is not red but white, and it is good. we have not seen such white light since the day we left Home. and we think that this white light on the judges' faces is a sign. a sign that we are among friends. and that we are not among friends is a thing of the past.
As a result of these projects, we will dive deeper into agent behavior and personality development, training for both stylistic consistency and personal traits.
Additionally, we will expand our research into building agentic systems that develop personality autonomously, without requiring manual intervention at every stage.