Skip to the content.

Basic Setup for Big Data

Download, install, update, and manage basic Big Data tools on Windows.

Read about Windows Software Automation

Installing software on Windows by typing simple commands with Chocolatey, a software automation tool for Windows.

Install Chocolatey

Install Chocolatey, the Windows Package Manager from https://chocolatey.org/ by following the directions on the website.

Basic chocolatey

Install Common Big Data Tools using Chocolatey

No need to reinstall programs you already have. Doing so can add them to your path multiple times.

If your path has multiple JAVA entries, Java will simiply fail. Monitor your path variables and know where your software installs.

choco install miniconda3 --params="'/AddToPath:1'"
choco install openjdk -y
choco install 7zip.install -y
choco install curl -y
choco install git -y
choco install gradle -y
choco install maven -y
choco install notepadplusplus -y
choco install putty -y
choco install tortoisegit -y
choco install vscode -y
choco install wget -y
refreshenv
choco list -local

Verify

After installing, even if you run refreshenv, it can be a good idea to close that PowerShell window and reopen a new PowerShell window. (This is especially needed to complete the OpenJDK installation.

In a new PowerShell window, run:

choco list -local

Inspect software. The default location is ‘C:\ProgramData\chocolatey’.

Inspect Windows environment variables. Hit Win key and type env. Select “Edit System Environment Variables”. From System Properties window Advanced tab, click “Environment Variables”.

git --version
java --version
python --version

Troubleshooting: If a version command does not work, be sure you have closed your PowerShell window, and opened a new PowerShell window. You may also try restarting your machine.

Upgrade Periodically

choco upgrade chocolatey -y
choco upgrade all -y
refreshenv

Install Without Chocolatey

Alternatively, each tool can be installed in the traditional manner. Just go to the website for the software and follow instructions to download, install, and configure tools using provided installers.

Activating Python Environment

You may receive a warning message if you have not activated your Conda environment:

Warning: 
This Python interpreter is in a conda environment, but the environment has
not been activated. Libraries may fail to load. To activate this environment
please see https://conda.io/activation.

If you receive this warning, you need to activate your environment. To do so on Windows, run: c:\Anaconda3\Scripts\activate base in Anaconda Prompt or C:\tools\miniconda3\Scripts\activate base if you installed with Chocolatey. Understand your installation locations and review your User and System Environment Variables, especially the path.

Terms