Statistical
Lab Software - Overview
Our public labs have a full array of statistical, GIS, office productivity, and multimedia applications that are available to users on any lab machine. In this section, applications are listed by category for Windows machines, with a single section that lists Mac applications.
If there is a particular piece of software that you are interested in using and you do not find it on a lab machine, please [[contact]] or call (617-496-9365).
Overview
Data applications deal with the acquisition and manipulation of data, but do not have the capacity for any statistical analysis. Most of the data applications allow you to convert from one file type to another.
Current data applications available on lab computers (Windows only):
DataFerrett
Lab Version: 1.3.3
DataFerrett is a data mining tool that accesses data stored in TheDataWeb through the internet. DataFerrett allows you to select a databasket full of variables and then recode those variables as you need. You can then develop and customize tables. Selecting your results in your table you can create a chart or graph for a visual presentation into an html page. Save your data in the databasket and save your table for continued reuse.
TheDataWeb is a network of online data libraries that the DataFerrett accesses the data through. Data topics include, census data, economic data, health data, income and unemployment data, population data, labor data, cancer data, crime and transportation data, family dynamics, vital statistics data, etc. As a user, you have an easy access to all these kinds of data. As a participant in TheDataWeb, you can publish your data to TheDataWeb and, in turn, benefit as a provider to the consumer of data.
For further information, please see:
http://dataferrett.census.gov/
This software is available for all lab patrons.
DBMSCopy and Compare
Please Note: DBMS/Copy and Compare is no longer being sold and not supported by any vendor. Use Stat Transfer first for any file conversions. DBMS/Copy and Compare is still useful for older (pre-2000) file type conversions.
Lab Version: v8.0.0A & v7.0.0 (both on all Windows based lab workstations)
DBMS/Copy is a general purpose conversion software. It translates and transfers data between over 70 different data management systems, spreadsheets, statistical software, ODBC, and other application packages. DBMS/Copy is a very useful program for converting data from one format, such as an Excel spreadsheet, to another format such as a Stata ".dta" or SPSS ".sav" file. It has an easy-to-use graphical interface and can convert among dozens of different file types.
DBMS/Compare is that it is a multiple file comparison, data audit and analysis program. The program can scan any directory tree (including an entire disk), grab a list of all the variables in all the data files from multiple file formats, build views showing all variables across all databases and all databases across all variables. From there you can get value lists across multiple databases and do record by record comparisons across any number of databases. All information can be saved to Excel, HTML and ASCII files.
Useful Guides for DBMS/Copy and Compare:
- Converting Excel and other spreadsheet files
- Converting Stata, SPSS, SAS and similar files
- Converting Ascii delimited files
- Converting Ascii fixed format files
- Translating datasets with DBMS/Copy 7
Common file types:
| File type (extension): | DBMS/Copy pseudo-extension: |
| ASCII (dat) (*) | ascii2 |
| Excel 5 (xls) | xls5 |
| Excel 97/2000 (xls) | xls7 |
| Gauss (dat, dht) | gauss |
| Limdep binary (sav) | limsav |
| Limdep ASCII (dat) | limdat |
| Stata v6 8 byte doubles (dta) | stata6 |
| Stata v6 4 byte doubles (dta) | stata64 |
| Stata v7 8 byte doubles (dta) | stata7 |
| Stata v7 4 byte doubles (dta) | stata74 |
| Stata SE 8 byte doubles (dta) | statase |
| Stata SE 4 byte doubles (dta) | statase4 |
| SAS for Windows v6 (sd2) | sd2 |
| SAS for Windows v7 (sd7) | sd7 |
| SAS for Windows v8 (sas7bdat) | sas7bdat |
| SAS for Unix v7/v8 (sas7bdat) | sas7sun |
| SPSS for Windows (sav) | spsswin |
| SPSS for Unix | spsssun |
(*) To read an ASCII file you will need the interactive session (in order to specify the file's structure). If you write a file as ASCII, DBMS will write it as a comma-delimited file with a first row of headers (variable names).
Stat/Transfer
Lab Version: 9.05
Stat/Transfer is designed to simplify the transfer of statistical data between different programs. Stat/Transfer will automatically read statistical data in the internal format of one of the supported programs and will then transfer as much of the information as is present and appropriate to the internal format of another. Stat/Transfer preserves all of the precision in your data, while automatically minimizing the size of your output data set. Stat/Transfer also allows control over the storage format of your output variables. In addition to converting the formats of variables, Stat/Transfer also processes variable names, missing values and value and variable labels automatically. Stat/Transfer allows you to select just the variables and cases you want to transfer. In addition to the standard Windows interface, a command processor on Windows and on Unix allows you to run a transfer in batch mode. This makes it straightforward to set up fully automatic batch procedures for repetitive tasks.
For more information about Stat Transfer:
Useful Guides for Stat/Transfer:
- Stat/Transfer FAQ
- Converting a Stata file with value labels to SAS
- Converting a SAS file with formats to a Stata file
- Converting Files Across Formats With StatTransfer
- Using StatTransfer to Get ASCII Data into Stata
- Moving SPSS or Stata Files into SAS
- Converting SAS Data Files into Other Formats
- File Conversion for PCs and UNIX
Supported Formats
- 1-2-3
- Access (Windows only)
- ASCII - Delimited
- ASCII - Fixed Format
- dBASE and compatible formats
- Excel
- Epi Info
- FoxPro
- Gauss
- HTML Tables (output)
- JMP
- LIMDEP
- Matlab
- Mineset
- NLOGIT
- ODBC (Windows and Mac only)
- OSIRIS (input)
- Paradox
- Quattro Pro
- R
- SAS for Windows and OS/2
- SAS for Unix
- SAS Program with Fixed ASCII Data (output)
- SAS Transport
- SAS Value Labels
- SAS CPORT (input)
- S-PLUS
- SPSS Data
- SPSS Portable
- SPSS Program with Fixed ASCII Data (output)
- Stata
- Stata Program with Fixed ASCII Data (output)
- Statistica (Windows only)
- SYSTAT
Stat/Transfer’s ODBC support allows you to also read and write to such relational databases as Oracle, Sybase, Informix, and DB/2.
TextPipe Pro
Lab Version: 7.8.11
TextPipe Pro is a text processing application that takes a group of files and applies a set of operations or filters to each file in turn. Each filter performs an operation such as a search and replace, adding text to the left margin, converting end of line characters etc, and then passes the result on to the next filter, just as though the entire file had been processed using that filter first. TextPipe Pro has an extensive range of over 100 filters for adding, deleting, replacing, sorting and transforming text. Common tasks like converting files between PC, Macintosh, Mainframe and Unix formats are well catered for. With TextPipe's command-line automation or COM interface, complex processing tasks can be performed without user intervention. Filters included can also split and join files, add line numbers, word wrap, convert between OEM and ANSI and remove duplicate lines/HTML/columns/binary characters/ANSI codes.
Web sites can be maintained easily with multi-file search and replace, and the ability to add standard text (such as copyright messages or banner advertising) to the start and end of each file. Multiple spaces and blank lines can be quickly removed to improve download times.
TextPipe Pro replaces a substantial set of smaller text utilities with a unified and easy-to-use GUI. General users, web authors, administrators and programmers can readily perform complex text-processing with a minimum of technical knowledge.
TextPipe Pro makes it fast and easy to convert, transform and re-purpose data in text files, including
- HTML, XML and other structured documents from the WWW
- Fixed length or delimited files (CSV, Tab, Pipe, etc)
- Unix, Mainframe and PC/Windows end-of-line formats
- Inside Zip files, and the new Microsoft Office 2007 formats DOCX, XLSX, PPTX
- ASCII, ANSI, Unicode and EBCDIC files
- Security log files from firewalls, web servers etc
- EDIFACT, HL7, SWIFT and other structured formats
- Spooled print files
- Structured and unstructured reports of any size or dimension
For more information about TextPipe Pro:
http://www.datamystic.com/textpipe.html
Useful Guides for TextPipe Pro: