AI Apps - Fotios Mpouris

FotiFoti Automated Art Agent

A fully autonomous AI agent that runs an entire art business on autopilot — selecting original line art, colorizing it with AI, writing philosophical captions through a custom persona, and publishing carousel posts to Instagram, Facebook, and a live website. Zero human intervention required.

18-Step Automated Pipeline

Selects & tracks artwork — DynamoDB prevents duplicate posts across the entire library
AI colorization — GPT Image 1.5 transforms line art into 6 distinct styles: Picasso Cubist, Studio Ghibli Watercolor, Retro 70s Psychedelic, Kandinsky, George Condo, and Stuart Davis
Custom AI persona — GPT-4o generates captions through "Artemis," a Jim Morrison-inspired character who writes philosophical, psychedelic commentary unique to each piece
Multi-platform publishing — Instagram carousel + Facebook carousel + live website gallery update — all in a single Lambda execution
Fully serverless — AWS Lambda, S3, DynamoDB, and Secrets Manager; no servers to manage

Dual System Design

Automated scheduler — runs on a set cadence, tracks what's been posted, updates the website gallery automatically via GitHub API
Manual colorizer — lets me experiment with any of the 6 styles on any image without affecting the tracking database — built for testing new looks before adding them to the automated rotation

Tech Stack

AWS Lambda (Python 3.12) · S3 · DynamoDB · Secrets Manager
OpenAI GPT-4o (captions) · GPT Image 1.5 (colorization)
Instagram Graph API · Facebook Graph API · GitHub API

Hard-E v3.0 — Conversational AI Sales Agent

A voice-first, multi-tenant AI platform built for the contracting industry. Hard-E functions as an intelligent sales orchestrator — managing customer relationships, navigating CRM data in real-time, and holding lifelike conversations with a configurable personality and "soul." Not a chatbot. An AI teammate.

Architecture: Four Core Layers

The Brain — Claude Sonnet orchestrates intent classification, tool selection, and a dynamic "Soul Document" that gives each tenant their own AI personality, tone, and company-specific opinions
The Voice — Cartesia Sonic 3 delivers sub-100ms time-to-first-audio with sentence-level streaming; audio plays while the AI is still thinking. OpenAI TTS runs as a live fallback
The Memory — 3-tier system: session memory (24hr TTL), conversation summaries for long-term relationship context, and a Redis-backed knowledge tier covering company facts, learned patterns, and user preferences
Multi-Tenant Framework — "Tenant-aware from day zero." Each contracting company gets isolated data, memory, and personality. A guided 5-step onboarding questionnaire activates their instance

Performance Benchmarks

~40–90ms TTS latency (Cartesia Sonic 3)
~3–4 seconds CRM lookup using Compound Tools (parallel asyncio execution cuts latency ~50% vs sequential calls)
~1.6 seconds simple question response time
Audio-reactive UI — 56-bar frequency visualizer using Web Audio API AnalyserNode responds to the AI's voice in real-time

Safety & Control

Green Tier — internal reads (CRM lookups, memory checks) execute freely
Yellow / Red Tier — write actions (creating customers, updating job stages) require confirmation or explicit readback before execution

On the Roadmap

WebRTC/WebSocket full streaming pipeline targeting sub-1-second end-to-end latency
Hover integration — 3D job site measurement as a native plugin
Dedicated STT microphone input with intelligent silence detection

PPA Mobilization Agent — AI-Driven Campaign Engine

The first live operational agent in the Poor People App's autonomous AI workforce. Powered by GPT-4o with a tightly defined writer's persona — George Orwell's clarity, George Washington's resolve, Odysseus' strategic mind — it generates tactical content grounded in a live knowledge base, then deploys it directly to Telegram and Nostr with one click.

Five Content Formats

Text Posts — Social copy synthesized from research documents; mandatory knowledge base grounding
Sora Scripts — Production-ready video assets: 3 cinematic visual prompts + 20–40 second narration + auto-generated TTS audio (MP3 playable directly in admin panel)
GIF Prompts — Crisis dashboard visual style for animated infographics, data-sourced with citations
Direct Messages — 50–75 word cold outreach; calm, peer-to-peer, zero hype language
Email Outreach — 60–80 word structured emails; bans "revolution," "game-changer," and pitch language by rule

Self-Improving Memory Architecture

Short-term memory (S3) — Tracks last 5 generations to prevent topic repetition
Long-term learning (DynamoDB) — After every generation the agent scores its own knowledge base usage (0–10) and specificity (0–10), stores the full record, and injects past performance into the next session's system prompt
User feedback loop — Feedback saved to DynamoDB; exemplary outputs flagged for future fine-tuning

Tech Stack

AWS Lambda (Python) · DynamoDB · S3 · SES
OpenAI GPT-4o · OpenAI TTS-1-HD
Telegram Bot API · Nostr Protocol (3 relays)
Next.js 15 · TypeScript · React 18 · Docker · Nginx

TheGreekClawd — My First Digital Employee

Active Build — Based on OpenClaw Open Source Framework

TheGreekClawd is an autonomous AI agent being built to function as a full digital employee — capable of executing real development work, coordinating tasks through Telegram, and controlling coding agents via Claude Code. Based on the open-source OpenClaw framework, this project represents the frontier of agentic AI: not just an assistant that answers questions, but an agent that ships code, manages workflows, and operates independently between sessions. We are already communicating through Telegram and well on our way.

Telegram-native interface — already operational, coordinating in real-time
Claude Code integration — autonomous coding agent control and execution
OpenClaw framework — built on open-source agentic infrastructure
End-to-end task execution — designed to receive a goal and ship the result

RoadScout: Podcast & YouTube Summarizer

Fetches transcripts automatically from any YouTube link
Generates persona-driven AI summaries using GPT-4.1
Stitches narration into a single MP3 for offline listening
Push notifications via ntfy when tasks complete

Oriana: Article Summarizer & Reader

AI summarization with ElevenLabs Text-to-Speech
Supports URLs, pasted text, and multi-language
MP3 downloads for offline listening
Synthesizes multiple articles into cohesive narratives

Resume Tailor

AI-powered resume analysis and PDF output
V2: Enhanced AI customization and improved UI
Streamlit-powered for rapid iteration and deployment

Oriana: AI Investigative Assistant

AI journalism and research tool
Advanced source management system
Automated transcript generation

Initializing AI Showcase...

FotiFoti Automated Art Agent

18-Step Automated Pipeline

Dual System Design

Tech Stack

Hard-E v3.0 — Conversational AI Sales Agent

Architecture: Four Core Layers

Performance Benchmarks

Safety & Control

On the Roadmap

PPA Mobilization Agent — AI-Driven Campaign Engine

Five Content Formats

Self-Improving Memory Architecture

Tech Stack

TheGreekClawd — My First Digital Employee

— EARLIER BUILDS —

RoadScout: Podcast & YouTube Summarizer

Oriana: Article Summarizer & Reader

Resume Tailor

Oriana: AI Investigative Assistant