Instructions for Sharing Data With the Bioinformatics Core on the MSU HPCC

These step-by-step instructions are designed to help users new to Linux and High-Performance Computing (HPC) share and transfer data with the Bioinformatics Core using the Michigan State University High-Performance Computing Center (MSU HPCC).

Prerequisites

Step-by-Step Guide

  1. Access the MSU HPCC through OnDemand
    • Note: If you prefer to access the HPCC via ssh connection, skip to section 4.
    • Open your web browser (i.e. Chrome, Firefox, Safari, etc…).
    • Go to https://ondemand.hpcc.msu.edu/.
  2. Log In with Your MSU Credentials
    • Under Selected Identity Provider ensure that Michigan State University is selected and click the Log On button.
    • Sign in with your MSU NetID and password.
  3. Access the Shell Terminal
    • Click the “>_Development Nodes” dropdown tab on the top of the screen.
    • Click any of the listed development nodes to enter the Terminal window.
  4. Copying Data Out of the Shared Directory (Folder)
    • In the Terminal window, navigate to the shared directory by running (type and hit the enter key on your keyboard):
    cd /path/to/shared/directory
    
    • Replace “/path/to/shared/directory” with the path given to you by your consultant.
    • You may view a list of files within the shared directory by running:
    ls -lah
    
    • Copy files out of the shared directory to your HPCC account space by running:
    cp file_you_want_to_copy.txt /path/to/your/HPCC/account/
    
    • Important Note: To copy directories (folders), add -r after cp in the above command.
    • Replace “file_you_want_to_copy.txt” with the name of the actual file or directory you want to copy.
    • Replace “/path/to/your/HPCC/account/” with the path where you want to copy the file or directory (i.e. /mnt/home/yourUsername/NewDirectoryName/).
      • Tip For better organization, you can create a new directory to copy the shared data into. You can do this by running cd ~/ to navigate to your home directory, then run mkdir NewDirectoryName.
    • Inform your consultant when the transfer is complete so they can close access permissions.
  5. Copying Data Into the Shared Directory
    • Locate the path to your data:
      • If your data is not already on the HPCC, you will need to upload your data.
        • If your files are less than 200MB, the easiest way to do this is through OnDemand.
          • Log on to OnDemand and click the Files drop down tab on the top of the screen, then click Home Directory. There will be a blue Upload button near the top right, click it, then add the files and folders that you want to upload to the HPCC, and click the green Upload Files button. The path to your data will then be /mnt/home/yourUsername/ where yourUsername is your MSU NetID.
          • Note: For organization purposes, you may also make a new directory to upload the files into. Just make sure you know the path to your data on the HPCC (i.e. /mnt/home/yourUsername/DataToShare).
        • If your files are greater than 200MB, it will be best to use one of the options in this guide.
      • If your data is already on the HPCC, find the path to your data by using either OnDemand or the ls -lah command in the Terminal window.
    • In the terminal window type and enter this command:
    cd /path/to/your/data
    
    • Replace “/path/to/your/data” with the actual path found by following the above step.
    • Copy your data from your HPCC account space to the shared directory location by running:
    cp file_you_want_to_copy.txt /path/to/shared/directory
    
    • Important Note: To copy directories (folders), add -r after cp in the above command.
    • Replace “file_you_want_to_copy.txt” with the name of the actual file or directory you want to copy.
    • Replace “/path/to/shared/directory” with the path given to you by your consultant.
    • Inform your consultant when the transfer is complete so they can close access permissions.

Additional Information

Troubleshooting

Getting Help

Summary

By following these steps, you should be able to transfer data to and from the bioinformatics core shared directory on the MSU HPCC efficiently. If you encounter any issues or have questions, don’t hesitate to reach out to the support resources listed above.

October 02, 2024   Leah Terrian, Nicholas Panchy, John Vusich