AI and ML services introduction¶
Info
Created 12/2022 - Updated 1/30/24
Amazon started to use AI in product/book recommendations in the 90s. Now it is used everywhere: fulfillment automation, inventory management, voice driven interactions, drones...
The following diagram illustrates the Amazon ML stack:
ML Amazon Services¶
Rekognition¶
For detection inside of image or video. Used for face detection, labeling, celebrity recognition...
Transcribe¶
Speech to Text service, always self trained. It supports both synch and asynch requests. Words can be customized to meet specific business needs, and special vocabulary. Can be enhanced with custom training sets.
Polly¶
Text to speech. It uses speech synthesis markup language (SSML) to help us put emphasize on words, includes breathing sounds, whispering.... Also it is possible to customize the pronunciation of words and do substitution using Pronunciations lexicons.
Support 31 languages, 20+ NTTS.
It also supports asynchronous synthesis to process up to 200k characters. Texts are sent over TLS and Poly does not retain texts processed.
It supports SSML, Speech Markup Language to control speech delivery: pause, paragraphs, rate of speech, phonetic pronunciations, acronyms and abbreviations.
Translate¶
Text translation service that uses advanced machine learning (neural network) technologies to provide high-quality translation on demand. Low latency <150ms and < 80ms for conversation.
5000 characters in synch requests, and support batch processing.
It is trained on 11 domains. Pay per use.
Lex¶
Build conversational bots or chatbots and virtual assistant. Lex includes speech to text, and NLU.
It also includes ML to do automated designer of the conversation by analysing existing conversation transcripts.
Amazon Textract¶
Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents. It uses Optical Character Recognition.
Amazon Connect¶
Cloud contact center.
Comprehend¶
Uses natural language processing (NLP) to extract insights about the content of documents. Entity extraction, detect language, sentiment analysis...
No need to provide training data, it is always trained.
SageMaker¶
See dedicated note.
Forecast¶
Fully managed service that uses statistical and machine learning algorithms to deliver highly accurate time-series forecasts.
Kendra¶
Highly accurate and intelligent search service that enables users to search unstructured and structured data using natural language processing and advanced search algorithms.
User can define ingestion process for documents and repositories.
Personalize¶
Fully managed machine learning service that uses our data to generate item recommendations for the end users. Recommenders are services to be integrated in application to get real-time recommendation. Optimized for Retail, media and entertainment. It is usind on amazon.com.
Includes recipes to define recommenders.
Amazon Personalize samples Git Repo.
ML domain of knowledge¶
Data Engineering¶
Data repositories¶
Data ingestion solution¶
Data transformation solution¶
Data analysis¶
Sanitize and prepare data for modeling¶
Feature engineering¶
Data analysis for ML¶
Modeling¶
Frame business problems as ML problems¶
Model selection¶
Train model¶
Perform hyperparameter optimization¶
Evaluate model¶
ML implementation and operations¶
Build model for enterprise needs¶
Technology fit for purpose¶
Security specific for ML¶
Deploy and operationalize¶
- Expose endpoints and interact with them.
- Understand ML models.
- Perform A/B testing.
- Retrain pipelines.
- Debug and troubleshoot ML models.