Add Notes, Voice Clone TTS, fix auth persistence and maxTokens

Notes:
- notes table with TEXT/AUDIO types, category support
- Audio upload → OpenRouter Gemini STT → OCI GenAI polish/summary
- Raw STT saved separately in raw_content column
- Polish/summary button for manual re-processing
- Async processing with real-time polling

Voice Clone TTS:
- Qwen3-TTS 1.7B model on A10 GPU via FastAPI server
- Voice profile registration (record/upload → save embedding)
- Profile-based TTS generation API
- TTS web page with recording, profile management, generation

Auth fixes:
- Store both access + refresh tokens in localStorage
- Initialize state from localStorage synchronously (no flash)
- Request interceptor reads token from localStorage every request
- Refresh via body (not just cookie)

Other fixes:
- maxTokens 4096 → 65536 (OCI GenAI Gemini supports up to 65536)
- Fix broken Korean chars in source files
- OpenRouter config for STT
- ffmpeg installed for audio conversion
- Ollama + Gemma 4 E4B installed (STT fallback)
- nginx proxy for TTS server (/api/tts/)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This commit is contained in:

joungmin

2026-04-13 07:34:18 +00:00

parent 6c2129d42e

commit 1088b23790

14 changed files with 1863 additions and 120 deletions

3

.gitignore vendored

View File

@@ -68,3 +68,6 @@ oracle_data/
 # ========================
 .claude/
 cookies.txt
 audio-uploads/
 voice-profiles/
 *.wav

Add Notes, Voice Clone TTS, fix auth persistence and maxTokens

3 .gitignore vendored Unescape Escape View File

3

.gitignore vendored

View File