Tag Archives: WPS v3.3

Technical Document – Post Installation Steps for WPS Workstations

We just finished updating a document for WPS users, specifically those on the Windows Platforms entitled, “Configuring Your WPS Workstation after Installing WPS v3.3.” This document helps those who are evaluating the WPS product on Windows learn about and install some features that is specific to WPS.

The document is short, only 14 pages but touches on modifying the WPS.CFG file as well as installing R and Python to get the greatest amount of utility out of WPS. If you have installed WPS on a Windows Workstation and are looking to get additional utility out of your WPS software, this document is for you.

To download the file, click here or download at:

http://minequest.com/downloads/Post-Installation-Steps-for-WPS-Workstations.pdf (1008KB)

About the author: Phil Rack is President of MineQuest Business Analytics, LLC located in beautiful Tucson, Arizona. Phil has been a SAS language developer for more than 25 years. MineQuest provides WPS consulting and contract programming services and is an authorized reseller of WPS in North America.

Why WPS v3.3 is important

A few weeks ago, I posted a blog about the new release of WPS v3.3. Quite a few people and companies have been waiting for this release and I want to talk about why this is such an historic release. I also want to write about what this release means to data scientists.

First, this release is truly a blockbuster. With the Interop for R and Python modules, this is the first time as far as I am aware, where a software product allows a developer to easily use the Language of SAS, R and Python to implement a program, project or product – all in one development environment. As data science has evolved, R and Python have become more popular but BOTH lack the data management capabilities of WPS. With WPS, you not only get the language of SAS to process your data, but you also get a multitude of database engines to read\write and access data in the most popular databases.

The beauty of such integration is ease of development. If you have been a developer for as long as I have, you know the time demands of learning a new development environment. Now, with WPS v3.3 a developer can stay within a single development environment for all of their analytical development needs. The developer doesn’t have to learn an R IDE to write, test and execute R code. The same can be said for Python. The Python interface with WPS means having development control as well as execution control of python programs or similarly R programs.

Organizations want to take advantage of R and Python integration because it allows them to create and expand programs and projects. It will quickly become apparent for third-party developers, i.e. those who want to create vertical market applications that these two additional languages radically increase their tool sets going forward. For many, it also means running production jobs that are self-contained, meaning control and execution is controlled by WPS and not a bunch of separate tasks or processes having to be handled individually.

I have been fortunate to have had access to Alpha and Beta builds of WPS for v3.3 and was totally blown away by the improvements and additions that I saw. For example:

  • The data step is faster.
  • Inclusion of the Python programming language.
  • Implementation of Proc IML.
  • Faster data engine access for many databases including multi-threaded loading.
  • More complete graphics output in statistical procedures.
  • PDF support

But what makes this one of the most compelling releases is the integration of R and Python. With Python specifically, one now has access to executing Machine Learning code from WPS. I’m sure many of you who read this blog are involved in credit scoring, fraud detection, anti-money laundering, market basket analysis, loyalty programs and other real-time analytics. With Python and OpenGL and the CUDA libraries, one can now perform incredibly high speed processing of data on your desktop/server GPU.

When looking at the breadth of the WPS offering, it’s amazing how much is included for the data scientist who needs to work in multiple languages supporting data analytics. With all the database engines (including Hadoop), WPS Graphics, IML, R, Python, WPS Statistics, and WPS Time Series, I almost faint thinking what the cost would be for something similar from our competitor SAS Institute.

The other aspect of WPS v3.3 that is so enticing is the licensing. There are two components to this that deserve discussion. The first, is that Data Service Providers (DSP’s) can make use of this software to develop and provide statistical and mathematical models for third parties. Let’s face it, many if not most organizations lack the expertise to develop these models for companies.

The second component is the creation of Vertical Market Applications using the WPS software. The absolute reasonable cost of the software is a driving factor in being able to create and resell your VMA at a price that small, mid-size and large companies can all afford. Using WPS as the basis for your VMA is advantageous because you’re not beholden to some other organization requiring you to pay partnership fees for access to the software and marketing. I think you would be shocked to learn about how competitive and perhaps ruthless a software vendor can be when it comes to introducing and pricing a competing VMA.

If you want to learn more about the latest release of WPS v3.3, especially as it pertains to workstations, read the previous blog post and download the latest brochure for v3.3. You can request an evaluation of WPS v3.3 by contacting info@minequest.com or filling out our evaluation or quote request form here.

I hope everyone had a Merry Christmas and wishing everyone a Happy New Year.

 

About the author: Phil Rack is President of MineQuest Business Analytics, LLC located in beautiful Tucson, Arizona. Phil has been a SAS language developer for more than 25 years. MineQuest provides WPS consulting and contract programming services and is an authorized reseller of WPS in North America.

WPS v3.3 Now Available

On Thursday December 15th WPL introduced WPS v3.3. This new version is available for immediate download. With v3.3, WPS includes a slew of new Procedures that will be of great value to those who hold WPS licenses and those who are looking to convert over to WPS from SAS.

New Language Procedures

Matrix Language Support is now available with PROC IML. PROC IML is included as a standard procedure in WPS and is not an additional cost module. There is a 430 page programming guide in PDF format that is included in the installation folder detailing how to use the Matrix Programming Language.

Python Support is now included in WPS with PROC PYTHON. PROC Python allows a WPS user to create, edit and invoke python programs from within WPS. The implementation of PROC PYTHON is very similar to PROC R. PROC PYTHON is included in WPS and is not an additional cost module.

ODS Support

WPS now includes output to PDF as well as HTML and Text output destinations. Note that PDF support is available on all platforms except z/OS at this time.

New Statistical Procedures

PROC ACECLUSProvides two methods for approximating the within-cluster covariance structure for a clustering model under the assumption of equal multivariate Gaussian distributed clusters.

PROC CANCORR – Identifies and measures the associations among two sets of variables.

PROC GENMOD – its generalized linear models.

PROC LIFEREG – Fits parametric, accelerated failure time models in the presence of left-, right- and interval censored data.

PROC LIFETEST – Estimates non-parametric survival functions in the presence of censored data using Kaplan-Maier or actuarial methods.

PROC LOESS – Fits non-parametric regression surfaces to multi-dimensional input data. The smoothness of the non-parametric model can be controlled. Outliers in the input data are detected.

PROC MI – Imputes the values of missing values in an input dataset.

PROC MIXED – Fits a mixed linear model to input data.

PROC MODECLUS – Produces various cluster output statistics.

PROC PHREG – Fits the Cox proportional hazards model to survival data.

PROC PROBIT – Fits binary or ordinal response regression models, useful for dose-response type analysis. Various types of model are supported by the procedure. Parameter estimates are generated through the use of maximum likelihood estimation. Model fit statistics enable the quality of the generated model to be assessed.

PROC VARCOMP – Fits generalized linear models with random effects, where the associated covariance matrix is assumed to be diagonal.

Note that WPS Statistics is included in the cost of a WPS license and is not a module that needs to be licensed separately at an additional cost.

New Graphics Procedure

PROC GBARLINE – The GBARLINE Procedure has been added to WPS. This procedure allows you generate bar charts on which plot data has been overlaid on to the bar chart.

New Data Engine

XLSX Engine – This is a cross platform engine that provides read and write access to file in Microsoft Excel format. You can process Excel data on any platform you choose and are no longer limited to Windows platforms. The XLSX engine is included in WPS and is not an additional cost module.

Data Engine Enhancements

NETEZZAM -Is a replacement engine for the NETEZZA Engine. NETEZZAM provides for multi-threaded operation using a new architecture enabling significant performance increases. The NETEZZAM engine is included in WPS and is not an additional cost module.

ORACLEM – Is also a replacement for the ORACLE Engine of prior releases. ORACLEM is also multi-threaded bringing performance increases. The ORACLEM engine is included in WPS and is not an additional cost module.

Both the above engines provide for the ability to Bulk-Load data.

There are a number of additional language features and workbench features that are worth investigating as well. WPS v3.3 is a major release where the functionality and language and procedures have been augmented.

For a list of all the WPS Procedures and Database Engines that are currently supported in v3.3, you can download a two-page brochure from MineQuest. This brochure lists the database engines that are supported on the Linux, OS X and Windows platforms as well as language support and PROC Support.

About the author: Phil Rack is President of MineQuest Business Analytics, LLC located in beautiful Tucson Arizona. Phil has been a SAS language developer for more than 25 years. MineQuest provides WPS consulting and contract programming services and is an authorized reseller of WPS in North America.