
Career Summary
I am Senior Solutions Architect at Amazon Web Services, specializing in designing and implementing NoSQL, analytics, machine learning, and Generative AI solutions in the cloud. A passionate advocate for cloud innovation, he has authored over 10 blogs on AWS technologies. Ray has demonstrated his expertise by earning all 14 AWS certifications, as well as 4 Google Cloud certifications, showcasing his proficiency across multiple cloud platforms.
Work Experience
Solutions Architect
Solutions Architect at Amazon Web Services (AWS)
Achievements:
- AWS Certified - All 12 Certifications
- Google Cloud - 4 Certifications
- Area of Depth of Big Data & Analytic
- Area of Depth of Machine Learning (ML/AI)
- Deploying Apache MXNet on AWS Lambda @aws-samples
- Catalog and analyze Application Load Balancer logs more efficiently with AWS Glue custom classifiers and Amazon Athena @aws-blogs/li>
- Amazon Athena Partition Projection and Glue Partition Indexes Performance Comparison @aws-blogs
- Custom Lens WA hub: DynamoDB, Glue, and OpenSearch @aws-samples (500+stars)
- Reduce food waste to improve sustainability and financial results in retail with Amazon Forecast @aws-blogs
- Evaluating beyond accuracy with Amazon Personalize by focusing on Diversity, Serendipity, and Novelty @aws-blogs
- Identify objections in customer conversations using Amazon Comprehend to enhance customer experience without ML expertise @aws-blogs
- The Power of Generative AI in Retail Industry: New technology is reshaping fashion retail @aws-community
Technologies used:
- AWS
- Hadoop
- Spark
- Hudi
- SageMaker
- Microservice
Senior Software Engineer
Scala Developer and Data Engineer in Intelligent Healthcare Data Platform(IHDP). Provide the query engine with fine-grained data access controlled on our data lake. Apache Ranger as centralized security framework to manage fine-grained access control across EMR cluster and Hive query engine. Onboard API for the data source to persist security envelope which for data lake to start to ingest data and providing the required information for ETL job. AWS Glue Crawler to discover and register the schema for raw data, merge data from other data source and load to the data lake.
Achievements
- Design and construct the cross-department cloud for healthcare (IHDP)
- Design and implement the security mechanisms for HIPAA compliance
- Design the onboard API which implements data ingest progress of data lake
- Design and implement the data pipeline with AWS Glue ETL
- Design and develop the ranger policy version control service for the query engine
- Design experiments for comparing the pros and cons between AWS solution
Technologies used:
- Scala
- Play
- Spark
- Hadoop
- AWS
- MySQL
Senior Software Engineer
Responsible for international advertising platform and dashboard, as a server to server advertisement provider. Collecting advertisement from the advertiser, dispatching to our channel and SDK later. Providing not only brand advertising but also performance-based advertising, the advertiser pays only for measurable results. Deploy Nginx server as loading balance server, routing requests to the available node. Kafka as high-throughput message queue service, dispatch messages to consumers. Apache Hive as query engine and provide query API and report service, however, it takes a lot of time on the Map-Reduce process. Therefore, we implement the micro-service with Akka actor system and Druid as our time series database and keep HDFS as deep storage, to aggregate data by dimensions in real-time.
Achievements
- Design and develop dashboard and report system with fine-grained authZ control
- Implement crawler with Akka actor and multiple worker and dispatch
- Implement three-layer data provider cooperate with micro-service and eventbus
- Implement transformer and refine data workflow to improve performance
- Implement query API with druid and HDFS as deep storage
- Implement message queue which dispatch transaction log, consumed by Akka actor
Technologies used:
- Scala
- Play
- Druid
- Kafka
- Microservice
Architect and Lead Developer
Focus on providing our client consultant service and helping them design and build the mobile application, website, dashboard, platform, and API service. To approach high concurrent API and reduce operating costs we deployed our service on AWS cloud, which provides on-demand cloud computing service. With service base on AWS cloud architecture, we can not only loading balance service to reduce loading of each node but also message queue service to schedule request from entry point or other nodes.
Achievements
- Design and build AWS cloud architecture and sync data flow
- Design and build database repository communicate with data provider
- Design and build RESTFul API communicate with database repository
- Design high concurrent API with loading balance, auto-scaling, and message queue
- Design and build mobile application and communicate with backend API
- Design and build micro-service and manage data flow
Technologies used:
- AWS
- PHP
- MySQL
- MongoDB
Skills & Tools
Backend
-
AWS
-
Scala
-
Java
-
PHP
-
Microservice
Big Data & ML
-
Hadoop
-
Spark
-
Druid
-
SageMaker
-
Big Query
Others
- Hive
- Spark ML
- ElasticSearch
- DataOps
- MLOps
- Git
- GCP
- Compute Vision
- Recommendation
Education
-
BSc in MISNational Central University2010 - 2014
Certifications
-
AWS Certified Cloud Practitioner
-
AWS Certified Developer – Associate
-
AWS Certified SysOps Administrator – Associate
-
AWS Certified Solutions Architect – Associate (SAA-C02)
-
AWS Certified DevOps Engineer – Professional
-
AWS Certified Solutions Architect – Professional
-
AWS Certified Advanced Networking – Specialty
-
AWS Certified Security – Specialty
-
AWS Certified Machine Learning – Specialty
-
AWS Certified Database – Specialty (DBS-C01)
-
AWS Certified Big Data – Specialty
-
AWS Certified Alexa Skill Builder – Specialty
Language
- Chinese (Native)
- English (Professional)
Interests
- Swimming
- Traveling
- Foodie
- Cooking