Asha: Your AI Health Companion 🤖💙

Welcome to Asha, an innovative AI-powered health assistant designed to revolutionize personal healthcare management. Asha combines cutting-edge natural language processing with advanced speech recognition to offer a compassionate, intelligent, and interactive experience.

🌟 Key Features

Natural Conversations: Engage in human-like dialogues for health advice and emotional support.
Voice-Activated Assistance: Hands-free interaction through advanced speech recognition.
Lifelike Responses: Hear Asha's advice with natural-sounding text-to-speech technology.
Email Management: Stay on top of your health-related correspondence effortlessly.
Smart Scheduling: Book appointments using simple voice commands.
Seamless Integrations: Connect with Google Calendar and Gmail for a unified experience.
Eye-Friendly Interface: Toggle dark mode for comfortable viewing at any time of day.
Conversation Tracking: Review your chat history for consistent care.
Device Flexibility: Access Asha on various devices with our responsive design.

🛠️ Technology Stack

Frontend: React.js with TypeScript
Framework: Next.js for optimal performance
Voice Interaction: Web Speech API
AI Core: Llama 3.1 AI Model
Voice Synthesis: Piper and WaveNet for natural speech

📋 Prerequisites

Before embarking on your Asha journey, ensure you have:

Node.js (v14 or later)
npm (v6 or later)
Google Cloud Platform account with active Gmail and Calendar APIs
OAuth 2.0 credentials for Google API integration

🚀 Getting Started

Let's bring Asha to life on your local machine:

Clone the Repository

git clone https://github.com/your-username/asha-health-assistant.git
cd asha-health-assistant

Install Dependencies
```
npm install
```
Configure Environment Create a .env.local file in the root directory:
```
NEXT_PUBLIC_API_URL=your_api_url_here
```

🎙️ Setting Up Piper for Voice Synthesis

Clone and Build Piper

git clone https://github.com/rhasspy/piper.git
cd piper
mkdir build && cd build
cmake ..
make

Download Voice Model

curl -L -o models/en_US-libritts-high.onnx https://huggingface.co/rhasspy/piper-voices/resolve/v1.0.0/en/en_US/libritts/high/en_US-libritts-high.onnx
curl -L -o models/en_US-libritts-high.onnx.json https://huggingface.co/rhasspy/piper-voices/resolve/v1.0.0/en/en_US/libritts/high/en_US-libritts-high.onnx.json

Install espeak-ng
```
brew install espeak-ng
```

Configure Piper Environment Add to your .env.local:

PIPER_PATH=/path/to/your/project/piper/build/piper
PIPER_MODEL_PATH=/path/to/your/project/piper/models/en_US-libritts-high.onnx

🔗 Integrating Google Services

Set Up Google OAuth 2.0
- Navigate to the Google Cloud Console
- Create or select a project
- Enable Gmail and Google Calendar APIs
- Generate OAuth 2.0 credentials for web application
- Configure authorized origins and redirect URIs
Configure API Scopes Ensure your OAuth consent screen includes:
- https://www.googleapis.com/auth/gmail.readonly
- https://www.googleapis.com/auth/calendar.events

Add Google Credentials Update .env.local with:

NEXT_PUBLIC_GOOGLE_CLIENT_ID=your_google_client_id
GOOGLE_CLIENT_SECRET=your_google_client_secret

🗣️ Conversing with Asha

Wake Asha: Say "Hey Asha" or "Hello" to begin.
Email Management:
- "Read my recent emails"
- "Any unread messages?"
- "Check for important emails"
Appointment Scheduling:
- "Book a doctor's appointment for tomorrow at 2 PM"
- "Schedule a health checkup for next Monday morning"

🧠 Under the Hood: Multi-threading

Asha operates on two primary threads for optimal performance:

Text Processing Thread: Manages AI responses, user inputs, and conversation history.
Audio Processing Thread: Handles speech recognition, text-to-speech conversion, and audio playback.

This parallel processing ensures smooth interactions and swift response times.

🎓 Training Custom Voice Models

Enhance Asha's voice with personalized models:

Data Collection: Utilize audio_download_create_wav_files.py for audio processing.
Data Cleaning: Run process_wav_files_to_remove_wav_errors.py for WAV file validation.
Transcription: Employ transcript.py with the Whisper model for accurate transcriptions.
Model Training: Use processed audio and transcripts to train your custom voice model.

For an in-depth guide, refer to our Colab training notebook.

💻 Development

Launch Asha in development mode:

npm run dev

🤝 Contributing

We welcome contributions to make Asha even better:

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Asha: Your AI Health Companion 🤖💙

🌟 Key Features

🛠️ Technology Stack

📋 Prerequisites

🚀 Getting Started

🎙️ Setting Up Piper for Voice Synthesis

🔗 Integrating Google Services

🗣️ Conversing with Asha

🧠 Under the Hood: Multi-threading

🎓 Training Custom Voice Models

💻 Development

🤝 Contributing

Files

README.md

Latest commit

History

README.md

File metadata and controls

Asha: Your AI Health Companion 🤖💙

🌟 Key Features

🛠️ Technology Stack

📋 Prerequisites

🚀 Getting Started

🎙️ Setting Up Piper for Voice Synthesis

🔗 Integrating Google Services

🗣️ Conversing with Asha

🧠 Under the Hood: Multi-threading

🎓 Training Custom Voice Models

💻 Development

🤝 Contributing