Spark - IT文库_程序员IT互联网编程电子书和文档免费下载，助您码力十足！

首页文库资料文章资讯上传文档发布文章登录账户

TiDB中文技术文档

2. $ cd /opt/spark/data/tispark-sample-data 3. $ mysql -h tidb -P 4000 -u root < dss.ddl 当样本数据加载到 TiDB 集群之后，可以使用 docker-compose exec tispark-master /opt/spark/bin/spark-shell 来访问 Spark shell。 1. $ docker-compose exec tispark-master /opt/spark/bin/spark-shell 2. ... 3. Spark context available as 'sc' (master = local[*], app id = local-1527045927617). 4. Spark session available as 'spark'. 5. Welcome to 6. 16. scala> import org.apache.spark.sql.TiContext 17. ... 18. scala> val ti = new TiContext(spark) 19. ... 20. scala> ti.tidbMapDatabase("TPCH_001") 21. ... 22. scala> spark.sql("select count(*) from lineitem")

0 码力 | 444 页 | 4.89 MB | 6 月前
3
深入 Kubernetes 的无人区-蚂蚁金服双十一的调度系统

XGBoost Spark SparkApplication Spark Operator 在线应用无状态服务 Node Node Node Node Node Node 无状态服务无状态服务无状态服务无状态服务无状态服务无状态服务 TF PS 1 TF Worker 1 TF Worker 2 spark driver 1 spark exec 1 spark exec 212/19

0 码力 | 19 页 | 2.18 MB | 6 月前
3
TiDB v8.5 Documentation

nodes based on the actual status of the system. • TiSpark • If your data needs to be analyzed with Spark, deploy TiSpark. For specific process, see TiSpark User Guide. 142 3.4.4 Data preparation After topology. TiSpark is a component built for running Apache Spark on top of TiDB/TiKV to answer complex OLAP queries. It brings benefits of both the Spark platform and the distributed TiKV cluster to TiDB and deployed in the home directory of the user. 5.3.6.3 Prerequisites TiSpark is based on the Apache Spark cluster, so before you start the TiDB cluster that contains TiSpark, you must ensure that Java Runtime

0 码力 | 6730 页 | 111.36 MB | 10 月前
3
TiDB v8.2 Documentation

nodes based on the actual status of the system. • TiSpark • If your data needs to be analyzed with Spark, deploy TiSpark. For specific process, see TiSpark User Guide. 112 3.4.4 Data preparation After topology. TiSpark is a component built for running Apache Spark on top of TiDB/TiKV to answer complex OLAP queries. It brings benefits of both the Spark platform and the distributed TiKV cluster to TiDB and deployed in the home directory of the user. 5.3.7.3 Prerequisites TiSpark is based on the Apache Spark cluster, so before you start the TiDB cluster that contains TiSpark, you must ensure that Java Runtime

0 码力 | 6549 页 | 108.77 MB | 10 月前
3
TiDB v8.3 Documentation

nodes based on the actual status of the system. • TiSpark • If your data needs to be analyzed with Spark, deploy TiSpark. For specific process, see TiSpark User Guide. 115 3.4.4 Data preparation After topology. TiSpark is a component built for running Apache Spark on top of TiDB/TiKV to answer complex OLAP queries. It brings benefits of both the Spark platform and the distributed TiKV cluster to TiDB and deployed in the home directory of the user. 5.3.7.3 Prerequisites TiSpark is based on the Apache Spark cluster, so before you start the TiDB cluster that contains TiSpark, you must ensure that Java Runtime

0 码力 | 6606 页 | 109.48 MB | 10 月前
3
TiDB v8.4 Documentation

nodes based on the actual status of the system. • TiSpark • If your data needs to be analyzed with Spark, deploy TiSpark. For specific process, see TiSpark User Guide. 137 3.4.4 Data preparation After topology. TiSpark is a component built for running Apache Spark on top of TiDB/TiKV to answer complex OLAP queries. It brings benefits of both the Spark platform and the distributed TiKV cluster to TiDB and deployed in the home directory of the user. 5.3.6.3 Prerequisites TiSpark is based on the Apache Spark cluster, so before you start the TiDB cluster that contains TiSpark, you must ensure that Java Runtime

0 码力 | 6705 页 | 110.86 MB | 10 月前
3
TiDB v8.1 Documentation

nodes based on the actual status of the system. • TiSpark • If your data needs to be analyzed with Spark, deploy TiSpark. For specific process, see TiSpark User Guide. 112 3.4.4 Data preparation After topology. TiSpark is a component built for running Apache Spark on top of TiDB/TiKV to answer complex OLAP queries. It brings benefits of both the Spark platform and the distributed TiKV cluster to TiDB and deployed in the home directory of the user. 5.3.6.3 Prerequisites TiSpark is based on the Apache Spark cluster, so before you start the TiDB cluster that contains TiSpark, you must ensure that Java Runtime

0 码力 | 6479 页 | 108.61 MB | 10 月前
3
TiDB v8.4 中文手册

会成为主要瓶颈，也容易产生读写热点。此时 TiFlash 节点数与 OLAP 计算量有较复杂非线性关系，需要根据具体系统状态调整节点数量。 97 • TiSpark – 如果你的业务需要基于 Spark 进行分析，请部署 TiSpark。具体步骤，请参阅TiSpark 用户指南。 3.4.4 HTAP 数据准备 TiFlash 部署完成后并不会自动同步数据，你需要指定需要同步到 TiFlash TiSpark 部署拓扑本文介绍 TiSpark 部署的拓扑，以及如何在最小拓扑的基础上同时部署 TiSpark。TiSpark 是 PingCAP 为解决用户复杂 OLAP 需求而推出的产品。它借助 Spark 平台，同时融合 TiKV 分布式集群的优势，和 TiDB 一起为用户一站式解决 HTAP (Hybrid Transactional/Analytical Processing) 的需求。关于 tispark_masters: - host: 10.0.1.21 ### NOTE: multiple worker nodes on the same host is not supported by Spark tispark_workers: - host: 10.0.1.22 - host: 10.0.1.23 monitoring_servers: - host: 10.0.1.10 grafana_servers:

0 码力 | 5072 页 | 104.05 MB | 10 月前
3
TiDB v8.2 中文手册

会成为主要瓶颈，也容易产生读写热点。此时 TiFlash 节点数与 OLAP 计算量有较复杂非线性关系，需要根据具体系统状态调整节点数量。 81 • TiSpark – 如果你的业务需要基于 Spark 进行分析，请部署 TiSpark。具体步骤，请参阅TiSpark 用户指南。 3.4.4 HTAP 数据准备 TiFlash 部署完成后并不会自动同步数据，你需要指定需要同步到 TiFlash TiSpark 部署拓扑本文介绍 TiSpark 部署的拓扑，以及如何在最小拓扑的基础上同时部署 TiSpark。TiSpark 是 PingCAP 为解决用户复杂 OLAP 需求而推出的产品。它借助 Spark 平台，同时融合 TiKV 分布式集群的优势，和 TiDB 一起为用户一站式解决 HTAP (Hybrid Transactional/Analytical Processing) 的需求。关于 tispark_masters: - host: 10.0.1.21 ### NOTE: multiple worker nodes on the same host is not supported by Spark tispark_workers: - host: 10.0.1.22 - host: 10.0.1.23 monitoring_servers: - host: 10.0.1.10 grafana_servers:

0 码力 | 4987 页 | 102.91 MB | 10 月前
3
TiDB v8.5 中文手册

I/O 会成为主要瓶颈，也容易产生读写热点。此时 TiFlash 节点数与 OLAP 计算量有较复杂非线性关系，需要根据具体系统状态调整节点数量。 • TiSpark – 如果你的业务需要基于 Spark 进行分析，请部署 TiSpark。具体步骤，请参阅TiSpark 用户指南。 3.4.4 HTAP 数据准备 TiFlash 部署完成后并不会自动同步数据，你需要指定需要同步到 TiFlash TiSpark 部署拓扑本文介绍 TiSpark 部署的拓扑，以及如何在最小拓扑的基础上同时部署 TiSpark。TiSpark 是 PingCAP 为解决用户复杂 OLAP 需求而推出的产品。它借助 Spark 平台，同时融合 TiKV 分布式集群的优势，和 TiDB 一起为用户一站式解决 HTAP (Hybrid Transactional/Analytical Processing) 的需求。关于 tispark_masters: - host: 10.0.1.21 ### NOTE: multiple worker nodes on the same host is not supported by Spark tispark_workers: - host: 10.0.1.22 - host: 10.0.1.23 monitoring_servers: - host: 10.0.1.10 grafana_servers:

0 码力 | 5095 页 | 104.54 MB | 10 月前
3

共 23 条前往

页

TiDB 中文技术文档深入 Kubernetes 无人无人区蚂蚁金服双十调度系统 v8 Documentation 手册

分类

语言

格式

TiDB中文技术文档

深入 Kubernetes 的无人区-蚂蚁金服双十一的调度系统

TiDB v8.5 Documentation

TiDB v8.2 Documentation

TiDB v8.3 Documentation

TiDB v8.4 Documentation

TiDB v8.1 Documentation

TiDB v8.4 中文手册

TiDB v8.2 中文手册

TiDB v8.5 中文手册