HADOOP practice for beginners with illustration
HADOOP practice for beginners with illustration
1. Pre-requisite: Environment setting
2.HDFS system
3. Eclipse:
4. MySql
- 4.1 Download and installation:
- 4.2 Download and install sample database
5. Hive
7. HBase
8. PIG
7. Sqoop
- 7.1 Setup sqoop
- 7.2 Sqoop and Hive
8. Hive Performance Tuning
9. Flume
- 9.1 download
- 9.2 installation
10. Using flume to load twitter data into hadoop
11. Spark and scala
13. IPython
ELK
Appendix 1. Configure network for multiple nodes in hadoop cluster
Appendix 2. How-to install DNS server on hadoop cluster in CentOS7
Appendix 3. How to batch generate ssh key and send to multiple servers
- Method 1 Using Expect
- Method 2: Using Python
Appendix 4: Load data from file system to hdfs
Appendix 6: Configuring Hadoop Security
Appendix 5: Move Data from MySQL to HDFS
Appendix 6: Load data into table in Hive
- non-python:
- Using python
Appendix 7. Move Data (using Sqoop) from MySQL to HIVE
- Finding
Appendix 8. HDFS upgrade instructions
Appendix 9. Tune Hadoop Cluster to get Maximum Performance
Appendix 10 The Hadoop Ecosystem in a nutshell
Appendix 11. Common Linux Knowledge
Visualize near-real-time stock price changes using Solr and Banana UI
Cloudera documentation reference
- Regular-Expression Examples
Project: Bible Statistics
- Loading data
- Step by step:
Project 2: weblog analysis
- Processing & Analytical goals:
- Solution:
Disclaimer

Powered by GitBook

3. Eclipse:

3. Eclipse:

http://eclipse.mirror.rafal.ca/technology/epp/downloads/release/mars/2/eclipse-jee-mars-2-linux-gtk-x86_64.tar.gz

results matching ""

No results matching ""