일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | |||||
3 | 4 | 5 | 6 | 7 | 8 | 9 |
10 | 11 | 12 | 13 | 14 | 15 | 16 |
17 | 18 | 19 | 20 | 21 | 22 | 23 |
24 | 25 | 26 | 27 | 28 | 29 | 30 |
- 운영체제
- Regular Expression
- 43. Multiply Strings
- DWG
- 시바견
- Python Code
- Substring with Concatenation of All Words
- Convert Sorted List to Binary Search Tree
- 30. Substring with Concatenation of All Words
- shiba
- Protocol
- Class
- 109. Convert Sorted List to Binary Search Tree
- Python
- concurrency
- 프로그래머스
- 315. Count of Smaller Numbers After Self
- data science
- t1
- Python Implementation
- Generator
- attribute
- 밴픽
- kaggle
- 컴퓨터의 구조
- LeetCode
- Decorator
- 파이썬
- iterator
- 715. Range Module
- Today
- Total
Scribbling
Google Cloud Platform Certificate; Professional Machine Learning Engineer 본문
Google Cloud Platform Certificate; Professional Machine Learning Engineer
focalpoint 2022. 4. 12. 19:22
Architecture for MLOps using TFX, Kubeflow Pipelines, and Cloud Build
- new data: only CT pipeline needed
- new pipeline or new implementation: CI/CD pipeline + CT pipeline needed
Data preprocessing for machine learning: options and recommendations
* Full-pass transformation: use only the training data to compute statistics
De-identification and re-identification of PII in large-scale datasets using Cloud DLP
https://cloud.google.com/architecture/de-identification-re-identification-pii-using-cloud-dlp
Machine Learning Crash Course
https://developers.google.com/machine-learning/crash-course/ml-intro
* Precision & Recall: Trade-off
- if decrease threshold, # TP & # FP increase -> Recall increase = Precision decrease
GCP Products
- best data pipeline, for stream and batch processing, necessary for stream
- SQL syntax, support ML models and importing TF models to the platform
- storage + analysis (fully managed)
- implement functions at this platform (new data -> pub/sub + cloud function)
- handles workflow for kubeflow pipeline
- use it to migrate from on-premises to GCP
- IaaS; Virtual machines
- modeling, training, analyzing, evaluation ... solution for managing ML models
- PaaS; platform that executes your code
- scheduler for task automation
- fully managed database for MYSQL
- fully managed database for large scale & low-latence workloads
- workflow orchestration service built on Apace Airflow
- data ingestion
- integrate data from multiple cloud sources
- Apache Spark & Hadoop clusters
Dumps
https://www.examtopics.com/exams/google/professional-machine-learning-engineer/
tf.distribute.MirroredStrategy supports synchronous distributed training on multiple GPUs on one machine.
tf.distribute.TPUStrategy lets you run your TensorFlow training on Tensor Processing Units (TPUs).
tf.distribute.MultiWorkerMirroredStrategy is very similar to MirroredStrategy. It implements synchronous distributed training across multiple workers, each with potentially multiple GPUs.
tf.distribute.OneDeviceStrategy is a strategy to place all variables and computation on a single specified device.
'Computer Science > Data Science' 카테고리의 다른 글
Data Science 101 (1) | 2022.12.05 |
---|---|
Pandas Operations Repository (0) | 2022.11.20 |
(py)Spark Basics (0) | 2022.10.06 |
데이터 분석 방법의 기초 - Kaggle 타이타닉 예제 (0) | 2021.11.23 |
데이터 분석 방법의 기초 - Kaggle 주택 가격 예측 예제 (0) | 2021.11.23 |