HPLC Gradient Peaks - Fitting the Unwound Data

PeakLab v1 Documentation Contents AIST Software Home AIST Software Support

Approach 3: Unwinding the Gradient and Fitting a Once-Generalized Model with or without an IRF

If the gradient is perceived as a non-linear transform that compresses a peak, one can use the program's convolution/deconvolution engine to seek to unwind or undo the gradient, approximating the peak that would exist if the gradient had not occurred. Unlike the IRF deconvolution, where one seeks to remove a distortion to recover the true peak, here we seek to recover the true peak by adding back that which occurred within the gradient. In other words, we treat the gradient process as a deconvolution which narrows the peak, and we apply a convolution to restore the original width and shape of the peak. We treat the gradient process as a deconvolution which is undone by a subsequent convolution with an IRF which represents its system response function.

You may wish to refer to the tutorial: HPLC Gradient Peaks - Fitting Unwound Data (Tutorial)

In the spirit of allowing the data to reveal the nature of the system IRF altering it, we can take the simple approach of convolving different IRFs to see if a peak shape can be restored which can be more accurately fitted with an isocratic model. We can make the inference that the better an isocratic model is fitted, the closer we are to the IRF or system model that represents the gradient process.

If the gradient process is a purely linear one, we can use a zero order kinetic model to restore the peak shape. If the gradient process is first order, we can use an order 1 kinetic model. We can convolve any order kinetic model we wish. We can also fit a probabilisitic IRF. If we assume the gradient process is akin that of a one-sided Gaussian deconvolution, we can restore the original peak shape by convolving a simple half-Gaussian response function.

In this example, we apply a simple one-parameter Gaussian convolution data to the first data set (in blue above), the one we directly fit with the Gen2NLC model and realized a 12.5 ppm unaccounted variance. If this fit of the convolved data (the yellow curve) is significantly improved, we can assume we have unwound at least a portion of the gradient without introducing more harm than benefit. In fairness, convolution does smooth data, so we will see a modest improvement from this noise reduction, but only so long as the new shape is at least as applicable as the original to the model used in the fitting.

The half-Gaussian convolved data fits to an astonishing 0.28 ppm unaccounted variance, a 50x reduction in the squared error.

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99999972 0.99999969 0.03589583 58,284,780 0.28480840

Peak Type a0 a1 a2 a3 a4 a5

1 Gen2NLC 10.9961570 8.33755300 2.4757e-5 -1.98e-5 2.00155019 30.1860459

Equivalent Parameters

Peak Type a0 a1 a2 a3 a4 a5

1 Gen2HVL 10.9961570 8.33755300 0.02031824 -1.98e-5 2.00155019 0.07356203

Parameter Statistics

Peak 1 Gen2NLC

Parameter Value Std Error t-value 95% Conf Lo 95% Conf Hi

Area 10.9961570 0.00072634 15139.1556 10.9947124 10.9976017

Center 8.33755300 4.0391e-5 2.0642e+5 8.33747266 8.33763333

Width 2.4757e-5 4.543e-8 544.959992 2.4667e-5 2.4848e-5

Distortn -1.98e-5 3.7678e-7 -52.540847 -2.055e-5 -1.905e-5

Q-power 2.00155019 0.00052882 3784.91919 2.00049839 2.00260200

NLCasym 30.1860459 0.21616022 139.646626 29.7561119 30.6159799

The a₄ of the Gen2NLC fit is 2.002, essentially a pure Gaussian decay. Unlike the prior fit where the gradient was not unwound, here all of the parameters have strong significance. If you can identify and quantify the gradient's response function, as we did here, you can fit a GenNLC or GenHVL to the convolved data:

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99999969 0.99999967 0.03749380 66,778,129 0.31447412

Peak Type a0 a1 a2 a3 a4

1 GenNLC 10.9967995 8.33749378 2.4886e-5 -2.046e-5 30.4568072

Equivalent Parameters

Peak Type a0 a1 a2 a3 a4

1 GenHVL 10.9967995 8.33749378 0.02037084 -2.046e-5 0.07441454

Parameter Statistics

Peak 1 GenNLC

Parameter Value Std Error t-value 95% Conf Lo 95% Conf Hi

Area 10.9967995 0.00072330 15203.5459 10.9953611 10.9982379

Center 8.33749378 3.6714e-5 2.2709e+5 8.33742077 8.33756678

Width 2.4886e-5 1.2767e-8 1949.29874 2.486e-5 2.4911e-5

Distortn -2.046e-5 3.1904e-7 -64.124470 -2.109e-5 -1.982e-5

NLCasym 30.4568072 0.20405363 149.258834 30.0510241 30.8625902

Here we have a higher F-statistic in the simper model. A F-statistic represents a model's capacity to represent the data properly accounting the number of parameters thrown at the fitting problem.

Fitting an GenHVL-based vs GenNLC-based Model for HPLC Gradient Peaks

The GenHVL model fits an a₂ deconvolved Gaussian SD and an a₄ generalized normal asymmetry at infinite dilution. The GenNLC fits an a₂ kinetic width, the time constant of first order Giddings kinetics, and an a₄ asymmetry indexed to both the Gaussian(HVL) at a₄=0 and to the Giddings(NLC) at a₄=0.5.

Since the Equivalent Parameters section of the Numeric Summary offers the parameters for both, you may wish to fit whichever model offers the confidence statistics you are most interested in. For fitting single peaks, there is no difference between the fit of these models. We mention elsewhere the value of selecting a kinetic or probabilistic generalized model based on an asymmetry parameter shared across peaks. Whereas this can make small differences in non-gradient analytic peaks, for gradient peaks the differences are negligible because of the low a₃ chromatographic distortion.

Drawbacks of a Convolution Preprocessing

There is the obvious issue of loss of resolution. In the 11 peak example, all 11 peaks are clear, even if several are not fully baseline resolved.

If we apply the half-Gaussian convolution to undo the gradient and fit the 11-peak data set, there are just 10 local maxima peaks which fit to a higher 87 ppm unaccounted variance. The original tenth and eleventh peaks fit as a single peak of an asymmetric shape.

Fitted Parameters

r² Coef Det DF Adj r² Fit Std Err F-value ppm uVar

0.99991271 0.99990859 0.03684701 248,671 87.2855663

Peak Type a0 a1 a2 a3 a4 a5

1 Gen2HVL 0.34204975 4.49269628 0.02086406 -1.792e-5 1.99181957 0.13034307

2 Gen2HVL 0.01166148 4.61046668 0.02123436 -3.016e-5 1.99181957 0.13034307

3 Gen2HVL 0.31537769 5.04350031 0.02072665 -9.335e-7 1.99181957 0.13034307

4 Gen2HVL 1.13003945 5.13668427 0.02049991 1.936e-6 1.99181957 0.13034307

5 Gen2HVL 0.75112499 5.20832806 0.02024335 -2.882e-6 1.99181957 0.13034307

6 Gen2HVL 0.19236483 5.57414105 0.02173971 -3.214e-6 1.99181957 0.13034307

7 Gen2HVL 0.81058549 5.82764475 0.02048346 -4.459e-6 1.99181957 0.13034307

8 Gen2HVL 0.30362755 5.95064910 0.02061090 -9.616e-6 1.99181957 0.13034307

9 Gen2HVL 0.25567534 6.13617861 0.02083078 -9.519e-6 1.99181957 0.13034307

10 Gen2HVL 0.18790095 6.28622363 0.02883085 0.00043404 1.99181957 0.13034307

When applying the convolution and subsequently fitting the unwound data, we are also giving up the improved resolution derived from the gradient. While the a₄ parameter fits very close to 2, the loss of resolution means that peak 11 must be fitted as a hidden peak where it was a straightforward matter to fit the Gen2HVL or Gen2NLC directly to the original data.

Note that a Gen2HVL or Gen2NLC fit to gradient data can uncover hidden peaks as evident by both a₂ and a₃ in this example.

HPLC Gradient Peaks - Fitting a Deconvolution Model