HPLC Gradient Peaks - Fitting Unwound Data (Tutorial)

PeakLab v1 Documentation Contents AIST Software Home AIST Software Support

Unwinding the Gradient and Fitting a Once-Generalized Model with or without an IRF

Once you have determined the IRF model and parameters for a gradient, you can use this system identification of the gradient to "unwind' the gradient as if an isocratic elution had occurred for all peaks. In this case, there is a preprocessing step of unwinding the gradient and then a subsequent fitting of the peaks with any one of the conventional isocratic models, including those with IRFs to manage any portion of the IRF which survived the gradient.

The process of using convolution to 'unwind' the gradient for subsequent conventional isocratic fitting is also discussed in the HPLC Gradient Peaks topic.

Use the File menu's Open item and select the file HPLCGradientStd02.pfd from the program's installed default data directory (\PeakLab\Data).

The file contains eight data sets, each a baseline-corrected data set from a single standard which elutes during the gradient. This data was used in the first gradient peak tutorial, where peaks were fitted to their compressed shapes, and in the second gradient peak tutorial, where the gradient was fitted and deconvolved to internally produce an estimate of a constant mobile phase peak. In this third gradient peak tutorial we will actually generate and fit these gradient deconvolved shapes.

Unwinding the Gradient with a Convolution Model

In the second gradient peak tutorial, we estimated a half-Gaussian IRF to unwind the gradient as having an SD of .0203. We will use this value to create a data level consisting of unwound data. We also found that one-step deconvolution models, with no IRF, produced an average unaccounted variance of 5.4 ppm. We want to see stronger fits in this two-step process.

We will now use the IRF Deconvolution procedure in reverse. We will employ an IRF as a convolution model to undo the compression that occurs within the gradient. We will use the estimation of the system gradient model from the prior fits to unwind the gradient's compression of the peak shapes.

Click on the IRF Deconvolution button and ensure the following settings are selected:

Response Fn
<g> Gaussian
  Convolve Right (we want convolution, not deconvolution)
IRF Parameters
0.0203 (the value from the GenHVL<*g*> deconvolution integral fit
Genetic Alg Optimization
  These settings will not be used
Fourier Filter
  None (no filtering is generally needed for convolution)
Post-Processing
  Zero Baseline Points - unchecked (baseline will be actual subtracted values)
  Zero Negative Points - unchecked (negative baseline points are permitted)

Note the loss of resolution between the blue gradient peaks, and the yellow peaks after the gradient has been unwound.

Click OK to close the procedure and OK to accept the default name and data set titles for the new data level.

Fitting the Unwound Gradient Peaks to the GenHVL<ge> Model

If we successfully unwound the gradient compression in the convolution, we should be able to treat the peaks as having occurred in an isocratic elution.

Click the Local Maxima Peaks button in the main window to open the peak placement screen. We will use the same settings as in the first gradient fitting tutorial where an isocratic peak was fitted:

Peak Detection
Set Sm n(1) to 5
Peak Type
Select Chromatography in the first dropdown
  Select GenHVL <ge> as the model in the second dropdown
Scan
Set the Amp % threshold to 1.5 %
  Leave Use Baseline Segments unchecked
  Be sure Use IRF,ZDD is checked
Vary
  Leave width a2 and shape a3 checked, all other unchecked

Click the IRF button, and if needed set the defaults for the <ge> IRF as 0.004 for the g width (sd), 0.016 for the e width (tau), and 0.625 for the g area fraction. Check the Lock for the g area fraction. This is what we previously did with the isocratic peak to ensure significance of the fitted parameters. We want to use proportionately lower starting estimates than the defaults to account the portion of the IRF that disappears in the gradient and is not unwound. Click OK.

Click Peak Fit to open the fit strategy dialog. Select the Fit with Reduced Data Prefit, 2 Pass, Lock Shared Parameters on Pass 1. Leave the Fit using Sequential Constraints box checked. Click OK. Once again, if you are fitting any peak model bearing an <irf>, you will almost always use this 2-pass option where the main peak parameters are first estimated while the starting estimates for the IRF and ZDD parameters are held constant.

Click Review Fit. You will see some exceptional fits. Four of the eight have fit errors less than 1 ppm unaccounted variances, an r² coefficient of determination of .999999 or higher.

Be sure As Fitted is selected in the first dropdown. This will be the unwound data and for most color schemes, a white curve.

Select the IRF Deconv. option in the second dropdown. This plots, in the first aux component color (red by default for most color schemes), the fitted peak with the IRF removed, the GenHVL, the peak you would see if there existed no system/instrumental distortion. The difference between the red and the white curves is the instrumental distortion recovered in the unwinding of the gradient.

Change the third dropdown to Partial Deconv. The second auxiliary component color is usually green by default. For this model, the green peak (the partial deconvolution) is the pure HVL, the theoretical peak after the ZDD's third moment non-idealities are mathematically removed. The red curve remains the peak absent the system/instrumental effects, the GenHVL. The difference between the red and green curves is that which occurs from fitting an asymmetry or additional third moment parameter in the ZDD, the zero distortion density.

Change the fourth dropdown to Full Deconv. The third auxiliary component color is usually blue by default. For this GenHVL<ge> model, the blue peak (the full deconvolution) is the pure Gaussian, the peak that would exist at infinite dilution. Despite the low concentrations, the a₃ intrinsic chromatographic distortion changes the peak shape from the infinite dilution blue peak, to the fronted HVL peak in green.

Click the Hide Y2 plot in the graph's toolbar.

Double click the third graph (the one with the best goodness of fit) and use the mouse to zoom in the peak.

For reference, this is the isocratic peak fitted in the first of the gradient tutorials.

Although the different deconvolutions are more pronounced in fit of the the isocratic peak (a significant part of the IRF is destroyed in the gradient), those of the unwound gradient follow the same pattern. The a₃ fitted chromatographic distortion remains fronted (a₃<0).

Double click the graph to restore all plots. Click on the Numeric button to open the numeric summary and under the Options menu, choose Select Only Fitted Parameters and then check Measured Values, Deconvolved Moments, and Average Multiple Fits. Scroll to the bottom of the report to see the averages.

We'll first compare the separate isocratic fit in tutorial 1 alongside the averages from these unwound gradient fits to see how much of the asymmetry in the ZDD, and how much of the IRF widths, are present in the unwound or decompressed peaks.

Peak Type a0 a1 a2 a3 a4 a5 a6 a7

Isocratic GenHVL<ge> 6.13746193 2.43158324 0.02205342 -0.0001284 0.11295617 0.01099098 0.01611737 0.62500000

Unwound GenHVL<ge> 6.90040719 8.62110363 0.02097263 -2.879e-5 0.09334915 0.00520456 0.00497757 0.62500000

The a₆ exponential width recovered from the unwound gradient peaks is only about 30% of the isocratic peak's value, very close to our expectation in terms of that portion of the IRF surviving the gradient we estimated by the genetic algorithm optimizations that were made in the first tutorial. For the a₅ half-Gaussian component of the 2-component IRF, we see about 50%, and for the a₄ ZDD asymmetry, we see about 83% of the isocratic value. Because the values are smaller, closer to zero, there are significance issues with the 2-component IRF in many of the fits.

Average for 8 Fits

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99999770 0.99999751 0.04037905 18,424,491 2.30209573

Peak Type a0 a1 a2 a3 a4 a5 a6 a7

1 GenHVL<ge> 6.90040719 8.62110363 0.02097263 -2.879e-5 0.09334915 0.00520456 0.00497757 0.62500000

Measured Values

Peak Type Amplitude Center FWHM Asym50 FW Base Asym10

1 GenHVL<ge> 127.569310 8.62662338 0.05154018 1.04872096 0.10302937 1.13018965

Peak Type Area % Area Mean StdDev Skewness Kurtosis

1 GenHVL<ge> 6.90027137 100.000000 8.62844834 0.02197493 0.21018122 3.10351136

Deconvolved Moments

Peak Type Area Mean StdDev Skewness Kurtosis Amplitude Center

1 GenHVL 6.90028425 8.62444886 0.00045822 0.20042433 3.06996218 131.172976 8.62282512

Peak Type Area Mean StdDev Skewness Kurtosis Amplitude Center

1 HVL 6.90039888 8.62440603 0.00044199 -0.0781386 3.00154004 133.023648 8.62577527

Peak Type Area Mean StdDev Skewness Kurtosis

1 Gauss 6.90040235 8.62011954 0.00044018 -7.762e-5 2.99970243

We see what we expect, the kurtosis no longer showing the gradient compression observed in the second gradient tutorial.

Click OK to close the Review, OK to accept a saved GenHVL<ge> fit in the data file, and OK to acknowledge the revision in the file. You will be back in the placement screen.

Fitting the Unwound Gradient Peaks to the Gen2HVL<e> Model

Here we will make the assumption that the remaining accuracy that can be realized in fitting must come from accepting that certain peaks were unwound too strongly and others too weakly in using a common convolution for the unwinding of all of the peaks. We will thus fit the Gen2HVL<e> model to allow the power of the decay to also be fitted, and we will also fit only the simple <e> exponential since the half-Gaussian component of the <ge> IRF cannot be consistently fitted to significance. We want the goodness of fit further improve. We also want the power of decay parameter to be a crosscheck on the unwinding, and we would like to see all parameters significant in all of the fits. We will also lock the exponential value to ensure that there is no interaction with the power of decay parameter.

Select Gen2HLC <e> as the model in the second dropdown.

Click IRF and set the <e> e width (tau) to 0.005 and check the Lock box. Click OK.

Click Peak Fit to open the fit strategy dialog. Be sure the Fit with Reduced Data Prefit, 2 Pass, Lock Shared Parameters on Pass 1 and Fit using Sequential Constraints boxes are checked. Click OK. Click Review Fit when the fits are complete.

If needed, click on the Numeric button to open the numeric summary, and once again scroll to the bottom of the report to see the averages.

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99999892 0.99999885 0.03617719 32,653,143 1.07749161

Peak Type a0 a1 a2 a3 a4 a5 a6

1 Gen2HVL<e> 6.89838721 8.62079134 0.02098648 -2.581e-5 2.00640831 0.09061789 0.00500000

Measured Values

Peak Type Amplitude Center FWHM Asym50 FW Base Asym10

1 Gen2HVL<e> 127.554469 8.62661407 0.05157938 1.05009849 0.10295706 1.12993439

Peak Type Area % Area Mean StdDev Skewness Kurtosis

1 Gen2HVL<e> 6.89820516 100.000000 8.62842884 0.02193464 0.20550638 3.06967023

Deconvolved Moments

Peak Type Area Mean StdDev Skewness Kurtosis Amplitude Center

1 Gen2HVL 6.89831839 8.62378304 0.00045505 0.19903353 3.06480468 131.146189 8.62215840

Peak Type Area Mean StdDev Skewness Kurtosis Amplitude Center

1 HVL 6.89838134 8.62375689 0.00044222 -0.0705598 3.00119686 133.010735 8.62498612

Peak Type Area Mean StdDev Skewness Kurtosis

1 Gauss 6.89838353 8.61983841 0.00044085 -6.149e-5 2.99976007

The eight fits now average just above 1 ppm unaccounted variance. Note that no parameters are grayed; every parameter is significant. Further, the a₄ power of decay averages to 2.006, a confirmation of the successful unwinding of the gradient. The a₄ varied from 1.981 to 2.037 suggesting the peaks did experience modest differences in the unwinding of the gradient, which in turn implies the possibility of differences in the compression in the original gradient peaks.

Click OK to close the Review, OK to accept a saved Gen2HVL<e> fit in the data file, and OK to acknowledge the revision in the file. You will be back in the placement screen.

In Perspective Once Again

In the third IC tutorial, we discussed the expectations in goodness of fit in prior versions of the peak fitting technology. If we separate the three fronted cation peaks in the data used in these IC tutorials and fit them with the same diligence we gave to the gradient peaks in this tutorial, we will see the following ppm unaccounted variance averages for the sixteen different conditions of concentration and additive level:

Peak	HVL	GenHVL	HVL<ge>	GenHVL<ge>
Li+	4779	4660	225.25	3.99
Na+	1657	1532	36.42	2.16
NH₄+	966	830	17.91	1.82

The HVL column represents the best fits attainable in the prior v4 of this peak-fitting technology. The GenHVL model includes the ZDD asymmetry adjustment but not the IRF adjustment. The HVL<ge> includes the IRF adjustment, but not the ZDD. The GenHVL<ge> includes both the IRF and ZDD adjustments. As we have stressed in the first six tutorials, the IRF and the higher moment(s) ZDD are exceptionally important in realizing close to zero error fits.

There was a point where sub-100 ppm unaccounted variance fits (r²=0.9999) were deemed impossible using all known theoretical and empirical models. Fitting the IRF and ZDD components in a generalized HVL or NLC model have made that level of fitting routine in PeakLab. As such, our objectives with fitting have changed. We now want to estimate peaks to such a high degree of accuracy that any diminished separation performance will not only be quantified, but where it can be easily observed to change across time.

In seeing these isocratic fits of these high S/N peaks fitted to 2-4 ppm, and with our last set of unwound gradient peaks fitting to 1 ppm, we can note that gradient peaks are capable of seeing equivalent goodness of fits.

Comparing the Three Approaches

Model	a0	a1	a2	a3	a4	a5	a6	a7	F-stat	ppm uVar
(1) Gen2HVL	6.89712	8.61235	0.01836	-6.535E-06	2.13182	0.00782			1,259,491	16.66
*(2) GenHVL<g>*	6.89667	8.62628	0.02142	-2.809E-05	0.09236	0.02032			11,487,966	5.38
*(2) GenHVL<k>*	6.89733	8.62310	0.02081	-3.058E-05	0.09493	0.02122	0.71312		5,759,968	5.70
(3) GenHVL<ge>	6.90041	8.62110	0.02097	-2.879E-05	0.09335	0.00520	0.00498	0.62500	18,424,491	2.30
(3) Gen2HVL<e>	6.89839	8.62079	0.02099	-2.581E-05	2.00641	0.09062	0.00500		32,653,143	1.08

While there are the differences we covered in the second gradient tutorial with respect to (1) fitting the eluted compressed gradient shape and (2) fitting a deconvolved model that internally unwinds the gradient, here we see no difference in a₀-a₃, the principal HVL parameters, nor in a₄, the skewness adjustment to the zero distortion density, between (2) fitting a deconvolved model that internally unwinds the gradient and (3) fitting the convolved (unwound) data.

Direct Closed Form Fit - Fitting the Compressed Shapes

In the direct fits of the Gen2HVL and GenHVL[Q] to the gradient shapes (the first gradient tutorial), we see compressed a₂ widths, much smaller a₃ distortions, a much higher a₄ power of decay, and an a₅ ZDD asymmetry that is also a great deal smaller. The gradient shape is fitted, and the a₄ power of decay parameter directly maps the strength of the gradient. The fits are slightly weaker, though still very effective. An isocratic model is being fitted to a gradient shape, but the variable power of decay, which fits the reduced kurtosis from the compression, the thinning of the tails, allows for these models to be effective with gradient shapes. With respect to the HVL estimated parameters, the a₁ location will not match exactly, the a₂ deconvolved Gaussian width will be smaller, and the a₃ chromatographic distortion will be far smaller, when contrasted with the isocratic deconvolved values for the peaks. The advantages of this approach are the direct measurement of the gradient strength and the ease of fitting very fast closed form models.

If you have no need to know what the peak would actually look like in an isocratic elution, and you need little more than the areas of some count of overlapping or hidden peaks, or if you want the plate count to include the gradient's contribution to the separation, you should fit the compressed shapes directly. You give up a small measure of accuracy (very small when seen in context), the fitted HVL peak widths will reflect the gradient compression, and the theoretical plates, at least by the method of moments, should reflect the separation that actually took place. You will still get a measure of the gradient strength in the fourth moment term. It is indirect, a measure of the fourth-moment tailing, but it was, at least for us, an effective estimate of the strength of the gradient.

Fitting the Gradient with a Deconvolution Model

In the GenHVL<*g*> and Gen2HVL<*g*> deconvolution models, we fit an intrinsic unwinding of the gradient. The a₀-a₃ HVL parameter values represent a best-case scenario where each peak is estimated as unwound, even if the data itself does not see this unwinding in an extrinsic step. The GenHVL<*g*> forces estimates which produce a full unwinding, and the half-Gaussian width measures the SD used for that unwinding. The Gen2HVL<*g*> additionally fits a power of decay which confirms the unwinding. The advantages of this approach are isocratic peak parameter estimates, and better goodness of fits since the gradient is modeled. The disadvantages include the need for baseline-resolved peaks with a strong baseline on each side of each peak, the need for a strong zero baseline on both sides of the data for the Fourier continuity, the need for a well-designed Fourier noise filter, and the fitting which must occur in the Fourier domain will be much slower, and prone to local minima very near the global minimum as a consequence of this noise filtration.

In this approach the estimate of the power of the gradient is in the unwind parameter(s), not in a parameter that describes the thinness of the tails of the gradient shape.

Because of these drawbacks, an approach where the gradient is fitted only on single peak standards used to test system stability is recommended. One can fit a deconvolution model to production data with many peaks, and we will do so at the end of this last gradient tutorial, but it is not, at least at present, a viable analytic approach for production data, since the requirements mentioned above will often be lacking.

Convolution Unwind and Subsequent Isocratic Fit

Although we need to know the IRF parameter(s) for the gradient, the unwinding occurs in a forgiving convolution. In this tutorial, we fit the GenHVL<ge> and Gen2HVL<e> models, the latter offering an incredible 1 ppm average unaccounted variance fitting accuracy. The GenHVL<*g*> of the one-step deconvolution fitting, and the Gen2HVL<e> of the unwound convolved data produce essentially the same HVL parameter estimates for an unwound isocratic peak. The advantage of the two step procedure is clearly the exceptional goodness of fits. The disadvantages are the need to predetermine the IRF for the gradient and the loss of resolution arising from the convolution of the data in the unwinding, a broadening that may make certain fits more difficult if a local maxima peak is lost or becomes less well-defined. There is also the issue of each peak seeing exactly the same unwinding in this preprocessing step, where the first two approaches process the gradient on a peak by peak basis.

If you compare the goodness of fit values from the gradient peak fits with the above cation fit reference values, they are comparable only when you make the extra effort to fit the gradient or to unwind the estimated gradient prior to fitting. Such may well be worthwhile, especially with critical certification of systems. To have everything mapped to within 1ppm unaccounted variance does mean you will catch any anomaly swiftly and you are effectively measuring everything that can be modeled. You are also getting an estimate of the true peak and its key parameters with the impact of the gradient removed. You see very accurate deconvolved Gaussian widths and chromatographic shapes from the a₂ and a₃ values, and when fitting a fourth moment Gen2 model, you also get a confirmation of the unwinding of the gradient, and such is available on a peak-by-peak basis if you have strong baseline resolved peaks and if this a₄ power of decay parameter is not shared across the peaks.

Importing Gradient Peak Production Data

Use the File menu's Open item and select the file HPLCGradientPeaks.pfd from the program's installed default data directory (\PeakLab\Data). The file contains one gradient peak data set which has already been preprocessed for baseline.

Click on the IRF Deconvolution button and ensure the following settings are selected:

Response Fn
<g> Gaussian
  Convolve Right
IRF Parameters
0.0203
Genetic Alg Optimization
  These settings will not be used
Fourier Filter
  None (a frequency domain filter is not normally required for convolution)
Post-Processing
  Zero Baseline Points - unchecked (baseline will be actual deconvolved values)
  Zero Negative Points - unchecked (negative baseline points are permitted)

The blue curve is the compressed data as eluted, the yellow curve is the convolution of the data and estimated gradient, the 'unwound' data. Note that the last peak loses its local maximum (there is no longer a defined apex) as it merges into the larger adjacent peak.

Click OK. Click OK to accept the default titles. There is now a second data level in the file with the name DC(<g>,CR,0.0203,None,0). The default data levels names reflect the global settings in the procedure. This level originated with the IRF Deconvolution procedure (DC), with the half-Gaussian IRF (<g>), a right-side convolution (CR), the <g> parameter (0.0203), and with no filter (None,0). The actual settings, including all custom settings, will appear in the automatically generated second graph title. In this case, the graph title is identical:

Fitting the Gradient Peak Shapes Directly

Select Imported Data from the Data Levels dropdown.

Click the Local Maxima Peaks button in the main window to open the peak placement screen. We will use the same settings as in the first gradient fitting tutorial for the isocratic peak:

Peak Detection
Set Sm n(1) to 5
Peak Type
Select Chromatography in the first dropdown
  Select Gen2HLC as the model in the second dropdown
Scan
Set the Amp % threshold to 0.5 % (we need to catch the tiny peak)
  Leave Use Baseline Segments unchecked
  Be sure Use IRF,ZDD is checked
Vary
  Leave width a2 and shape a3 checked
  Check a4 (we want the power of decay to vary on a peak by peak basis to estimate the gradient present at each peak)

We do not have to worry about the IRF since we are not fitting one. You should see 11 peaks automatically placed, all local maxima peaks (an apex is seen in the data).

Click Peak Fit to open the fit strategy dialog. Select the Fit with Reduced Data Prefit, Cycle Peaks, 2 Pass, Lock Shared Parameters on Pass 1. Leave the Fit using Sequential Constraints box checked. Click OK. The Cycle Peaks strategy performs certain fits on a peak by peak basis to ensure the fitting accuracy of small peaks. You will almost always want to use this strategy when multiple peaks are present. Despite the complexity of the Gen2HVL closed form model, the fitting should require less than a second, all cores of your CPU utilized for the fitting.

Click Review Fit. We have a lovely 19.97 ppm unaccounted variance.

If needed, click Show or Hide Legend Titles in the graph toolbar to shift the peak formulas to the left of the graph's area.

Click on the Numeric button to open the numeric summary and under the Options menu, choose Select Only Fitted Parameters.

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99998003 0.99997875 0.02249492 797,441 19.9725764

Peak Type a0 a1 a2 a3 a4 a5

1 Gen2HVL 0.45909985 7.36270241 0.01889909 -7.747e-06 2.13140190 0.00092235

2 Gen2HVL 0.01579321 7.51083120 0.01809079 -9.677e-05 1.68544369 0.00092235

3 Gen2HVL 0.42246880 8.10297608 0.01816043 -6.439e-06 2.11953543 0.00092235

4 Gen2HVL 1.51479133 8.22803626 0.01767838 -7.601e-06 2.11094630 0.00092235

5 Gen2HVL 1.01874016 8.32467112 0.01756996 -9.913e-06 2.09490368 0.00092235

6 Gen2HVL 0.25669538 8.81856822 0.01986309 2.3301e-06 2.10354569 0.00092235

7 Gen2HVL 1.08821191 9.15973955 0.01788931 -4.334e-06 2.11045430 0.00092235

8 Gen2HVL 0.40892996 9.32687815 0.01852180 1.768e-06 2.14282935 0.00092235

9 Gen2HVL 0.34371131 9.57662518 0.01890399 1.4983e-06 2.17368641 0.00092235

10 Gen2HVL 0.21967402 9.73581480 0.01968705 5.6924e-06 2.09690879 0.00092235

11 Gen2HVL 0.04173413 9.83391853 0.02322546 0.00010687 1.62394283 0.00092235

The a₅ fails significance testing but again this tests for differences from zero. In this case, a tiny positive a₅ simply means the fit was unable to find a statistically significant third moment variation from the the HVL's symmetric Gaussian zero-distortion density assumption (a₅=0).

The a₄ confirms the thinning of the tails on all of the peaks of significant area. The two very small peaks either lacked sufficient information, or the baseline correction introduced a sufficient error, so that the gradient compression could not be fitted. Also, when peaks are overlapped, as occurs here, this fourth moment may not have the benefit of baseline on either side. As you will note, however, except for the smaller peaks, all fit close to the 2.13 of the averaged standards. We know, in fitting the actual gradient shape, we will have a lower a₂and lower magnitude a₃ chromatographic distortions. Here two of these are indistinguishable from zero (neither tailed nor fronted) and three fit to tiny positive (tailed values).

Click OK to close the Review, OK to accept a saved Gen2HVL fit in the data file, OK to acknowledge the revision in the file, OK to return to the placement screen, and OK one last time to return to the main window.

Fitting the Unwound Gradient Peaks

We know that most of the isocratic IRF is effectively offset in the gradient, and that all that remains (that we can identify) in the unwound gradient peaks is a small amount of residual exponential distortion. For this ultra HPLC reverse phase chromatography, we found that an isocratic peak only had about 1/3 of the exponential IRF width in comparison with high S/N IC peaks (about 0.016 as contrasted with 0.040-0.045). Further, we found only about 0.005-0.006 of this exponential IRF survives in a gradient peak.

In our experience, fitting this small of an IRF is probably not going to add a significant benefit, and it will generally be simpler, and far faster, to fit a closed form model, typically the GenHVL, or the Gen2HVL (if you wish to fit a fourth moment term to confirm the successful unwinding of the gradient). You will wish to fit peaks with and without this <e> IRF to be certain, but in our experience a simple closed form model suffices unless you are seeking to estimate this small residual exponential IRF.

Select the uppermost 'DC' data from the Data Levels dropdown.

Click the Hidden Peaks - Second Derivative button in the main window to open the peak placement screen. We know the last of the peaks is now hidden. We will see if the D2 algorithm can automatically place this peak for us.

Baseline
  No Baseline
Peak Detection
Set Sm n(1) to 5 (this small level of D2 smoothing is possible because the S/N is very good on this data)
Peak Type
Select Chromatography in the first dropdown
  Select GenHVL as the model in the second dropdown
Scan
Set the Amp % threshold to 0.5 % (to again catch the tiny peak)
  Leave the D2 % threshold at 0.00 (all smoothed D2 minima drop below 0)
  Leave Use Baseline Segments unchecked
  Be sure Use IRF,ZDD is checked
Vary
  Leave width a2 and shape a3 checked (the widths and shapes must vary for each peak)
  Leave a4 unchecked (the third moment deviation from non-ideality will be shared by all peaks)

You should see 11 peaks, all automatically placed.

We will not be fitting an IRF, and unless you altered the ZDD default starting estimates for the GenHVL, there is no need to open the ZDD dialog.

Click Peak Fit to open the fit strategy dialog. Be sure the Fit with Reduced Data Prefit, Cycle Peaks, 2 Pass, Lock Shared Parameters on Pass 1 and Fit using Sequential Constraints boxes are checked. Click OK.

Click Review Fit when the fit is finished, only a second or two.

The 10.87 ppm error is good, but we will also fit the Gen2HVL model to see if there was too strong or too weak of an unwinding. The Gen2HVL model's a₄ power of decay parameter should fit to very close to 2.0 if the unwinding was accurate.

Click Cancel to immediately return to the placement screen. Change the model to the Gen2HVL. Click Peak Fit, and then click OK. Click Review Fit when the fitting is finished.

The 7.37 ppm error suggests we were fairly close to an optimal unwinding.

If needed, click on the Numeric button to open the numeric summary and under the Options menu, choose Select Only Fitted Parameters.

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99999263 0.99999224 0.01215434 2,669,738 7.37478257

Peak Type a0 a1 a2 a3 a4 a5

1 Gen2HVL 0.45992868 7.37517987 0.02193946 -3.838e-05 1.98065954 0.09357987

2 Gen2HVL 0.01575713 7.52961772 0.02225342 -8.448e-05 1.98065954 0.09357987

3 Gen2HVL 0.42323125 8.11622542 0.02144310 -2.686e-05 1.98065954 0.09357987

4 Gen2HVL 1.51676271 8.24150130 0.02104409 -2.531e-05 1.98065954 0.09357987

5 Gen2HVL 1.01627415 8.33837014 0.02087343 -2.579e-05 1.98065954 0.09357987

6 Gen2HVL 0.25783518 8.83123496 0.02305417 -2.37e-05 1.98065954 0.09357987

7 Gen2HVL 1.08963446 9.17290683 0.02123367 -2.232e-05 1.98065954 0.09357987

8 Gen2HVL 0.40942620 9.33910702 0.02169774 -2.281e-05 1.98065954 0.09357987

9 Gen2HVL 0.34422844 9.58850247 0.02192766 -2.546e-05 1.98065954 0.09357987

10 Gen2HVL 0.22281563 9.74900684 0.02323154 -1.534e-05 1.98065954 0.09357987

11 Gen2HVL 0.03861160 9.84264943 0.02692057 3.764e-05 1.98065954 0.09357987

The a₄ power of decay in the Gen2HVL fit to a value of 1.98, very close to our expectation. Most isocratic peaks, in our experience, will fit to 1.97-1.99, just below below 2.0. This suggests the <*g*> convolution width of 0.0203 taken from the standards produces an adequate unwinding, neither too strong nor too weak.

Click Cancel to immediately return to the placement screen and OK to return to the main window.

Fitting the Gradient in Production Data

We assume your standards will properly give you the gradient estimate, but if such is not the case, and you need to extract the gradient unwind parameter(s) directly from messy production data, we can recommend three approaches for getting the gradient estimate directly from a production sample. We will finish this gradient peak set of tutorials with these three methods:

Isolate One or More Baseline Resolved Peaks

If you have baseline resolved peaks, you can use a feature of the baseline subtraction to remove all adjacent peaks and force the zero extended baseline the deconvolution modeling requires. This is the simplest of the three approaches, but a measure of intelligent preprocessing is needed.

Create a Series of Different Unwindings and Determine the Optimum from Goodness of Fit

In this approach, you must generate replicate data sets and in each create a different strength of unwinding. By fitting a model whose power of decay is 2, such as the GenHVL, the goodness of fit will be at a maximum when the unwinding is closest to complete, neither overdone or underdone. This is an approach that works irrespective of baseline resolved peaks. It can appear somewhat cumbersome, but it is effective and it is difficult to get an incorrect estimate, and with some experience, it actually requires very little time.

Fit a Deconvolution Model to the Production Data as Is, Painstakingly Jumping the Fourier-Introduced Local Minima

In this approach, you simply nurse the deconvolution model, applied to every peak in the production data, through the Fourier filter's oscillations in n-dimensional parameter space. No preprocessing is needed, only a good measure of patience and persistence.

Isolate One or More Baseline Resolved Peaks

In this approach we must identify baseline resolved peaks in the data and zero everything else, including all peaks which we wish to remove from the estimation process.

Selected Imported Data from the Data Levels dropdown in the main window. Click Copy, answer Yes, and OK to accept the default new data level titles. We now have a Copied Data(*) data level. The (*) means it will be replaced by a Transform, Baseline, or IRF Deconvolution level if you immediately engage one of these procedures.

Click Subtract Baseline.

We will be manually assigning baseline and zeroing those points, so the settings for automated baseline detection will not matter. We are still fitting extracted peaks which will be baseline corrected (even though this data has already been baseline corrected), so the baseline modeling will matter. Please be sure the Model dropdown is set to Non-Parm Linear with the default NP n of 10, the window size of the points used for the non-parametric fit. The requirement of consecutive points can be left at 10. It will not matter when manually setting the baseline as we will now do.

Check Zero Baseline Points and Zero Negative Points. Note that Zero Baseline Points and Zero Negative Points are not normally used options for conventional baseline processing.

To help a bit with visualization, click the Zoom-In mode in the graph toolbar. Further zoom-in the already zoomed baseline points as follows:

We agree, there isn't one peak you would even want to isolate. Still, we will employ this first approach for gradient estimation with what we have available.

Click the Manual Sectioning mode. Be sure either the 2nd Deriv Zero or SD Variation algorithm is selected. Manual baseline assignment is only available with these algorithms.

Baseline points are assigned (peak removed) by left clicking the mouse at a point where the baseline is deemed to conclude, and sliding left and releasing the mouse when all of the intended baseline points have turned yellow (instead of gray).

Peak points are assigned (baseline removed) by left clicking the mouse at a point where the peak information is deemed to begin, and sliding right and releasing the mouse when all of the intended peak points have turned gray (instead of yellow).

What we wish to do is make the three peaks at 7.52, 8.82, and 9.58 discrete and the only peaks in the data that emerge from this procedure.

Reproduce as closely as you can the pattern of baseline and peak as above.

Start at the far right, click the mouse and slide to the far left, rendering the entirely of the data as baseline (all yellow). Place the mouse just before the start of the first peak that is grayed above, click and hold the mouse button down while sliding to the right and stop at the valley between the two peaks and release the mouse. For the second of the three peaks grayed above, start just before the peak begins to rise and stop just after the ugly feature in its tail ends. For the third peak grayed above, start just before the rise and again stop at the lowest point in the valley.

This does take some getting used to, as it is counterintuitive to assign major peaks as baseline. In actual practice, it only requires a few seconds.

If you look at the three gray peaks and the white baseline below them, you see that the first and last were not fully baseline-resolved, and the middle peak, which was, has a couple of very tiny components in its tail. We thus illustrate why other approaches to estimating the gradient with production data may have value. Even though the premise of isolating one more baseline-resolved peaks failed, we will proceed with the analysis.

One you have the assignments similar to the above, click OK, and click OK to acknowledge the manual processing.

Exactly as we did in the second tutorial, click the Local Maxima Peaks button.

Be sure the following settings are in place. We will fit the GenHVL<*g*> deconvolution model to get the gradient's half-Gaussian SD for these four peaks.

Peak Detection
Set Sm n(1) to 5
Peak Type
Select Chromatography w/IRF in the first dropdown
  Select GenHVL<*g*> as the model in the second dropdown
Scan
Set the Amp % threshold to 1.5 %
  Leave Use Baseline Segments unchecked
  Be sure Use IRF,ZDD is checked
Vary
  Leave width a2 and shape a3 checked, all other unchecked

If the IRF value for the <g> SD is the .02 from the prior tutorial or the default, the Gaussian IRF deconvolution width will be too high. There will again be these oscillations below zero from the deconvolution.

Click the IRF button and for the <g> IRF, enter 0.01 for the g width (sd) default starting estimate. Click OK.

Click Peak Fit to open the fit strategy dialog. Select the Fit with Reduced Data Prefit, Cycle Peaks, 2 Pass, Lock Shared Parameters on Pass 1. Leave the Fit using Sequential Constraints box checked. Click OK.

Click Review Fit.

The fit is weak, but such was not unexpected.

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99991167 0.99991032 0.01265508 799,409 88.3265916

Peak Type a0 a1 a2 a3 a4 a5

1 GenHVL<*g*> 0.45742137 7.37717205 0.02210176 -3.85e-05 0.09891214 0.02156776

2 GenHVL<*g*> 0.25689977 8.83269307 0.02314267 -2.804e-05 0.09891214 0.02156776

3 GenHVL<*g*> 0.34307824 9.59044454 0.02218133 -2.59e-05 0.09891214 0.02156776

The three peaks have a shared a₅ half-Gaussian deconvolution SD of 0.0216, in reasonable agreement with the 0.0203 average from the fits of the standards.

Click OK to close the Review, OK to accept a saved GenHVL<*g*> fit in the data file, OK to acknowledge the revision in the file, OK to return to the placement screen, and OK to return to the main window.

Creating an Optimization

The following steps are a useful exercise if you find you need to create an optimization experiment based on fitting replicated data sets with differences in preprocessing.

We will replicate the baseline-corrected data and generate different <g> convolution unwound data sets. If these sets are fit to models with only a skewness third moment adjustment, such as the GenHVL, where there is no fourth moment power of decay adjustment impacting the thinness or fatness of the tailing, this intrinsic power of 2.0 decay will cause the goodness to fit to an optimum where the gradient unwinding most closely accommodates the model. We want to fit a model whose goodness of fit will be adversely impacted by the lack of an optimal unwinding of the gradient, not a model that capably adjusts for unwinding that is either too weak or too strong.

With Imported Data selected at the main window, select New to create a new PeakLab data file with only this data level. Yes to proceed. Yes to save the prior data if asked to do so. Note that the newly created file is not yet saved.

Right click the graph and select Replicate this Data Set a Specified Number of Times from the popup menu. Enter 10 to create 10 replicates. There are now 11 identical data sets.

Click the IRF Deconvolution procedure button and be sure the following settings are in place:

Double click the first data set and enter a custom unwind <g> convolution width of 0.025. Double click to return to all data sets with this set retaining the custom adjustment. The first data set will show a <g> [0.025] in its graph title. Repeat this custom adjustment for each of the data sets. Double click the second data set, enter 0.024, and double click again to restore all of the graphs. Do the same for the third data set, enter 0.023, and for the fourth 0.022, and so on until the last of the 11 data sets is given the custom unwind width of 0.015.

Click OK, OK to acknowledge the local convolution warnings, and OK to accept the titles. There is now a 'DC' data level with 11 different <g> unwind convolution widths. And once more the tedious work is finished, and the program will do the rest.

Click the Hidden Peaks - Second Derivative button. We still need the hidden peak method for the higher deconvolution widths in the series which wash out the local maximum in the last tiny peak rendering it hidden. Leave everything as set in the prior fitting except change the model to the GenHVL. Again we need a model that enforces a power of decay of 2.0 in order for this optimization to work. You will see 11 peaks automatically placed in each of the 11 data sets.

Right click any one of the graphs in the Review and select the Map Experimental Process Variables for Fitted Data Sets option. Enter <g> for the variable name and the values 0.025, 0.024, etc. for the 11 sets. Click OK.

Click Explore, set <g> as the x variable and Fit Stats for the y variable.

For this production sample, the <g> optimal unwind width for a GenHVL fit is less, about 0.018 instead of the 0.020 we saw with the fitted standards. At this optimum, the ppm is at a minimum (the r² at a maximum), the SE is at a minimum, and the F-statistic at a maximum. It is straightforward to estimate 0.0135 as very close to the optimum <g> unwind width.

This optimization is accurate because we optimized the very process we will use for the unwound fitting. In fact, if we are willing to use the 0.018 value for the unwinding, we are already finished:

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99999187 0.99999145 0.01301282 2,476,110 8.13218881

Peak Type a0 a1 a2 a3 a4

1 GenHVL 0.45978603 7.37358446 0.02135035 -3.489e-05 0.07836035

2 GenHVL 0.01572565 7.52809144 0.02171820 -7.995e-05 0.07836035

3 GenHVL 0.42301167 8.11444321 0.02079366 -2.54e-05 0.07836035

4 GenHVL 1.51605097 8.23966666 0.02037827 -2.438e-05 0.07836035

5 GenHVL 1.01675991 8.33650315 0.02022356 -2.507e-05 0.07836035

6 GenHVL 0.25757480 8.82958350 0.02243447 -2.093e-05 0.07836035

7 GenHVL 1.08917613 9.17115589 0.02058068 -2.092e-05 0.07836035

8 GenHVL 0.40939675 9.33760412 0.02109691 -1.963e-05 0.07836035

9 GenHVL 0.34424315 9.58704329 0.02135228 -2.194e-05 0.07836035

10 GenHVL 0.22224651 9.74721598 0.02254189 -1.36e-05 0.07836035

11 GenHVL 0.03908607 9.83991249 0.02638390 3.4451e-05 0.07836035

Here we do see the the unwound peaks fitted to an isocratic model with a higher degree of accuracy than fitting the compressed shapes. Is there a difference between 8 ppm and 20 ppm unaccounted variance? The better question pertains to the difference between fitting the compressed shapes with a fourth moment isocratic model, and unwinding the gradient to produce an isocratic peak which is then fitted to model with only a third moment skewness adjustment.

As we noted prior, the unwound peaks will have a higher a₂ width.

Unwinding the peaks to a deconvolved isocratic shape does mean the a₃ chromatographic shapes should be more accurate. In the first gradient tutorial, we saw almost no correlation between the a₃ and the different moments in these direct fits of the compressed gradient shapes. In the second gradient tutorial, a₃ had a greater impact on moments when the gradient was modeled and some form of isocratic unwinding occurred. If we omit the two tiny peaks, the a₃ values vary here from -3.49e-5 to -1.36e-5, all fronted, and all in a very narrow range. The Gen2HVL fits to the gradient shapes had a mix of +a₃ and -a₃ values varying from -8.7e-5 to +9.2e-5. (fronted to tailed), and we know from the fits to the standards that all should be fronted (-a₃).

Click OK to close the Review, OK to accept a saved GenHVL fit in the data file, OK to acknowledge the revision in the file, OK to return to the placement screen, and OK to return to the main window.

Close this data file with the replicates.

Creating the new Data Layer with the Optimized Unwinding in Original File

We will create a new data level with the <g> convolution unwind width of .018.

Select the file HPLCGradientPeaks.pfd from the bottom of the MRU (most-recently used) list of files in the File menu.

Select the Imported Data in the Data Levels dropdown and click the Copy button. Answer Yes to create the new data level and OK to accept the default titles. Click the IRF Deconvolution button and change the <g> parameter value to 0.0135. Click OK to accept the convolution, and OK once again to accept the data level titles.

Since the peaks retained their local maxima with this 0.018 <g> response function convolution width, though barely, we no longer need to use a hidden peak method.

Click the Local Maxima Peaks button. From the Chromatography family, change the model to the GenHVL. Set the Amp % threshold to 0.5 %. In the Vary section, be sure the a2 and a3 are checked and that a4 is unchecked. You should see the 11 peaks, all automatically placed.

Click Peak Fit to open the fit strategy dialog. With the Fit with Reduced Data Prefit, Cycle Peaks, 2 Pass, Lock Shared Parameters on Pass 1 and Fit using Sequential Constraints boxes checked, click OK.

Click Review Fit when the fit is complete.

The error with the 0.18 fit is 8.13 ppm, and we again contrast this with the 19.96 ppm on the Gen2HVL direct fits of the compressed shapes.

If needed, Click on the Numeric button to open the numeric summary and under the Options menu, choose Select Only Fitted Parameters and then check Chromatographic Analysis.

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99999187 0.99999145 0.01301282 2,476,110 8.13218881

Peak Type a0 a1 a2 a3 a4

1 GenHVL 0.45978603 7.37358446 0.02135035 -3.489e-05 0.07836035

2 GenHVL 0.01572565 7.52809143 0.02171820 -7.995e-05 0.07836035