/tag/linux

  • Ivy Secure Environment

    Ivy is a secure computing environment for researchers consisting of virtual machines (Linux and Windows). Researchers can use Ivy to process and store sensitive data with the confidence that the environment is secure and meets HIPAA, FERPA, CUI or ITAR requirements.
    Facilities Statement - Are you submitting a grant proposal and need standard information about UVA research computing environments? Get it here. Overview Ivy consists of both virtual computing environments and secure storage. In order to obtain access to either system, users must
    Submit an account request, Complete the Information Security Awareness Training, and Ensure their personal computer meets all High Security VPN requirements.

  • ACCESS: Advanced Cyberinfrastructure Coordination Ecosystem: Services and Support

    The NSF’s ACCESS (Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support) program builds upon the successes of the 11-year XSEDE project, while also expanding the ecosystem with capabilities for new modes of research and further democratizing participation. ACCESS Home:  access-ci.org  access-ci.org/about Allocations Allocations: allocations.access-ci.org Documentation Support: support.access-ci.org Community Engagement ACCESS: support.access-ci.org/affinity-groups Campus Champions: https://campuschampions.cyberinfrastructure.org UVa Research Computing has two Champions, Ed Hall and Katherine Holcomb For more help, please feel free to contact RC staff to set up a consultation or visit us during office hours.
  • XSEDE: Extreme Science and Engineering Development Environment

    XSEDE’s Mission was to substantially enhance the productivity of a growing community of scholars, researchers, and engineers through access to advanced digital services that support open research; and coordinate and add significant value to the leading cyberinfrastructure resources funded by the NSF and other agencies. — The XSEDE project ended on August 31, 2022 and was succeeded by the  ACCESS project.
    XSEDE Home:  www.xsede.org

  • Software on Ivy Linux Virtual Machines

    Each Linux Virtual Machine (VM) comes with a set of preinstalled software applications. Each VM can further be customized via installation of optional software packages.
    An overview of available software packages for Windows VMs is provided here.
    Preinstalled Software Software Versions Description Anaconda2023.07-py3.11 The open-source Anaconda Distribution is the easiest way to perform Python/R data science and machine learning on Linux, Windows, and Mac OS X. R4.3.1 R is a free software environment for statistical computing and graphics. RStudio Server2023.06.2 RStudio Server enables you to provide a browser based interface to a version of R running on a remote Linux server.

  • Bioinformatics Packages on Ivy Linux VM

    Available Packages The following bioinformatics packages are available on the Ivy Linux Virtual Machines
    Bowtie2 Bowtie2 is a memory-efficient tool for aligning short sequences to long reference genomes.
    For bowtie2 usage information, please click [here] (/userinfo/ivy/ivy-linux-sw/bioinformatics/bowtie2)
    HISAT2 HISAT2 is a fast and sensitive tool for aligning short reads against the general human population
    (as well as single reference genome)

    • Requires approval before installation
      For HISAT2 usage information, please click here

  • Bowtie2 on Ivy Linux VM

    Bowtie2 is a memory-efficient tool for aligning short sequences to long reference genomes.
    It indexes the genome using FM Index, which is based on Burrows-Wheeler Transform algorithm,
    to keep its memory footprint small. Bowtie2 supports gapped, local and paired-end alignment modes.
    Alignment to a known reference using Bowtie2 is often an essential first step in a myriad of NGS analyses workflows.
    Bowtie2 Usage Alignment using bowtie2 is a 2-step process - indexing the reference genome, followed by aligning the sequence data.
    Create indexes of your reference genome of interest stored in reference.fasta file:
    bowtie2-build [option(s)] <reference.fasta> <bt2-index-basename> This will create new files with the provided basename and extensions .

  • Data Analysis Packages on Ivy Linux VM

    Available Packages The following Data Analysis packages are available on the Ivy Linux Virtual Machines
    MATLAB MATrix LABoratory (MATLAB for short) is a software designed for quick scientific calculations, such as matrix manipulation, plotting, and others.
    It has hundreds of built-in functions for a wide variety of computations and several tools designed for specific
    research disciplines, including statistics and partial differential equations.

    • Limited licenses available, for more information on MATLAB and licensing, please click [here] (/userinfo/ivy/ivy-linux-sw/data-analysis/matlab)
      SAS SAS is large platform independent software with multiple components, and is used for statistical analysis, data ETL operations, as well as several other

  • Database Software on Ivy Linux VM

    The following database software are available on the Ivy Linux Virtual Machines
    MySQL Is the most popular open-source relational database, used in academia and industry worldwide. It has been
    in use for over 20 years and is backed by a large developer community. It is available in both free and
    proprietary versions.
    MariaDB MariaDB is a community developed version of MySQL, and is highly compatible with MySQL and other relational databases. Existing databases can be easily migrated between MySQL and MariaDB, and vice versa.
    PostgreSQL Unlike MariaDB and MySQL, PostgreSQL is an object relational database, and can be used in a manner similar to other relational databases.

  • HISAT2 on Ivy Linux VM

    • Please note that HISAT2 requires approval prior to installation on the VM
      HISAT2 is a fast and sensitive tool for aligning short reads against the general human population
      (as well as single reference genome). It indexes the genome using a Hierarchical Graph FM Index
      (HGFM) strategy, i.e. a large set of small indexes that collectively cover the whole genome
      (each index representing a genomic region of 56 Kbp).
      HISAT2 Usage: Alignment using HISAT2 is a 2-step process - indexing the reference genome, followed by aligning the sequence data.
      Create indexes of your reference genome of interest stored in reference.

  • cTAKES on Ivy Linux VMs

    cTAKES Overview cTAKES or The clinical Text Analysis and Knowledge Extraction System, is a Mayo Clinic developed Natural Language Processing (NLP) tool used to
    extract information out of clinical records. It is open-source, and built on the Apache Unstructured Information Management Architecture. cTAKES
    is modular, expandable, for a number of generic use cases, and contains excellent best practice notes.
    cTAKES Usage cTAKES components Some of cTAKES components are listed below:
    Sentence boundary detection (OpenNLP technology)
    *Tokenization (rule-based) Morphologic normalization (NLM’s LVG) POS tagging (OpenNLP technology) Shallow parsing (OpenNLP technology) Named Entity Recognition Negation and context identification (both based on NegEx) cTAKES Named Entities cTAKES contains the following Named Entities:

  • User Licensed Stata on Ivy Linux VMs

    Stata Overview Stata is a graphical data analysis tool developed by StataCorp, and is short for Statistics and Data. It
    is used in various disciplines, including biomedicine, economics, epidemiology, among others. It is capable
    of performing statistical analysis, simulations, regression, and data management. Besides the standard version
    Stata also ships with the MP version (multi=processing), and SE for large databases.
    Users requesting an installation of Stata are required to provide their own license. Please consult with us before
    requesting an installation.
    You may also request a Stata license from the UVa Software Gateway
    Installing programs from SSC Please first run the following commands to use the proxy:

  • UVa Licensed IDL on Ivy Linux VMs

    IDL Overview IDL, short for Interactive Data Language, is an interactive shell based data analysis programming language. Used vastly in medical imaging, it can quickly create visualizations and graphs
    of large data sets in a few easy steps due to its vector nature. FORTRAN users would be familiar with the IDL syntax. IDL is not to be confused with
    Java IDL or Microsoft IDL.
    Basic IDL Usage To start IDL, open a terminal window and type idl. This will start the interactive shell.
    Variables in IDL To initialize variables in IDL, type:
    <variable_name> = <variable_value> e.g.
    x = 3 and hit Enter/Return

  • UVa Licensed MATLAB on Ivy Linux Virtual Machines

    MATLAB Overview MATLAB is a high-performance language for technical computing. It integrates
    computation, visualization, and programming environment. MATLAB stands for MATrix LABoratory. MATLAB was made
    to provide easy access to matrix software developed by the LINPACK (linear system package)
    and EISPACK (Eigen system package) projects. MATLAB includes a programming language
    environment with built-in editing and debugging tools, and supports object-oriented programming.
    Programming in MATLAB MATLAB has many advantages compared to conventional computer languages (e.g.,
    C, FORTRAN) for solving technical problems. MATLAB is an interactive system whose
    basic data element is an array, and almost all problems can be solved in MATLAB using that

  • UVa Licensed SAS on the Ivy Linux VM

    SAS Overview SAS is a command-driven software package used for statistical analysis
    and data visualization. It is available in .
    It is one of the most widely used statistical software packages in both industry and academia.
    You may use it if you have a large number of statistical algorithms. It is not limited to an industry,
    and could be used in both scientific and non-scientific contexts. We only offer the Teaching & Research version
    at the moment.
    Common Usage For this example we will use a common scenario from SAS Clinical Standards Toolkit, which is used for supporting clinical
    research activities.

  • Image Processing Software on Ivy Linux VM

    Pre-approved packages The following software packages are pre-approved for image processing on an Ivy Linux VM
    KNIME KNIME is open source analytics platform for data mining and pipelining.
    KNIME’s Image Processing Plugin allows users to perform common image processing
    techniques such as registration, segmentation, and feature extraction. KNIME is compatible with over 120 image file types and can be
    used alongside ImageJ.
    ImageJ ImageJ is a Java-based image processing program developed at the NIH.
    ImageJ can be used interactively through a graphical user interface or automatically with Java.
    OpenCV OpenCV is an open source library for computer vision applications.
    OpenCV includes modules for image processing, video analysis, machine learning, and much more.

  • LibreOffice On Ivy Linux VM

    LibreOffice Overview Our Linux VMs come prepackaged with the open source alternative to Microsoft Office(R), called LibreOffice.
    As of last writing, version 5 is installed, including the specific software suites mentioned below.
    LibreOffice is compatible with all Microsoft Office formats, and can be connected to services like
    Google Drive or DropBox. It is available under the Mozilla Public License. LibreOffice is full GUI
    software and would require you to RDP into your VM or use a graphical tool such as FastX in order to
    render it.
    LibreOffice Writer LibreOffice Writer is the word processor component of LibreOffice. It can save documents in .

  • Preinstalled Java SDK on Ivy Linux VM

    Java SDK Overview Ivy Linux VMs are installed with Java SDK 1.8. Java is a popular Object-Oriented programming
    language and is used in a multitude of scenarios. It is available under the GNU General Public
    License for all users. The SDK consists of a large number of tools such as javac that
    help in application development.
    Running Java commands from the Command Line Open a Command Line Terminal and enter java followed by the desired command. E.g. to find
    the version of the SDK
    java -version Running your code To compile java code, first cd to the location of your .

  • Preinstalled Perl on Ivy Linux VMs

    Perl Our VMs have Perl 5.16.3 available as part of the base linux
    VM. Licensed as open source under the GPL, it is most often used
    to develop mission critical software, and has excellent integration
    with markup languages such as HTML, XML, amongst others. Since it is both Object-Oriented and procedural, it could be used within a multitude
    of programming projects. It includes built in database integration via
    its DBI module. Other than DBI, it has thousands of modules, making it
    one of the most extensible languages. Due to its interpreted nature,
    Perl is similar to Python and would be easy to understand for those

  • Preinstalled Rodeo on Ivy Linux VM

    Rodeo Overview Our Linux VMs are installed with Rodeo version 2.5, as of the last update. Rodeo is a lightweight, Python based, IDE for data science.
    It has a very streamlined code-to-plot workflow, with easily extensible packages that make it simple to
    analyze difficult patterns in data. It includes many data analysis features under one roof, and adopts features from
    iPython Notebook (it actually runs atop the iPython kernel). Like most Python projects,
    it is open source and available for free.
    Launching Rodeo You can launch Rodeo from the Applications menu. It is a self-contained IDE that would not require any knowledge of the command line.

  • Preinstalled Python 2 and Python 3 with Anaconda on Ivy Linux VM

    Anaconda Our VMs have Python 2 and 3 available as part of the Anaconda distribution. Please refer to this page for more information.