Implemented option to plot pulls #219

TimLdl · 2023-06-06T10:11:07Z

No description provided.

cverstege

I have some general questions and comments about this implementation.

cverstege · 2023-06-06T11:47:14Z

kafe2/fit/xy/plot.py

+        if self._fit.did_fit and (
+                self._fit.has_errors or not self._fit._cost_function.needs_errors):
+            _band_y = self.y_error_band
+            return target_axes.fill_between(
+                self.model_line_x,
+                -_band_y, _band_y,
+                **kwargs)
+        return None  # don't plot error band if fitter input data has no errors...


This is the exact same code as plot_pull_error_band below. Shouldn't the pull error band be normed to one?

cverstege · 2023-06-06T11:50:27Z

kafe2/fit/histogram/plot.py

Formatting changes should be in a separate commit or can be omitted and then be done as a full codebase refactor. As far as I can tell, the changes in this file are formatting only.

cverstege · 2023-06-06T11:50:39Z

kafe2/fit/indexed/plot.py

Formatting changes should be in a separate commit or can be omitted and then be done as a full codebase refactor. As far as I can tell, the changes in this file are formatting only.

cverstege · 2023-06-06T11:52:08Z

kafe2/fit/unbinned/plot.py

+    def plot_pull(self, target_axes, error_contributions=('data',), **kwargs):
+        raise TypeError("Pull cannot be plotted for unbinned fits.")
+


Why is this function implemented here and not in the other PlotAdapters? Will it ever be called? I don't think so, because it inherits from base and not from XYPlotAdapter.
Correct me if I'm wrong.

The argument for pull plots is added to the base class. Without this addition you get The following error when trying to do an unbinned pull plot:

Traceback (most recent call last): File "/home/johannesg/Projects/kafe2/examples/010_unbinned_fit/qr_skwjYt.py", line 72, in <module> plot.plot(fit_info=True, asymmetric_parameter_errors=True, pull=True) # plot the data and the fit File "/home/johannesg/Projects/kafe2/kafe2/fit/_base/plot.py", line 1380, in plot _plot_results = self._plot_and_get_results( File "/home/johannesg/Projects/kafe2/kafe2/fit/_base/plot.py", line 872, in _plot_and_get_results _artist = _pdc.call_plot_method(_pt, File "/home/johannesg/Projects/kafe2/kafe2/fit/_base/plot.py", line 389, in call_plot_method return _callable( File "/home/johannesg/Projects/kafe2/kafe2/fit/_base/plot.py", line 685, in plot_pull (self.data_y - self.model_y) / self._get_total_error(error_contributions), File "/home/johannesg/Projects/kafe2/kafe2/fit/unbinned/plot.py", line 59, in data_y raise TypeError("There's no y-data in the unbinned container") TypeError: There's no y-data in the unbinned container```

cverstege · 2023-06-06T11:53:42Z

examples/006_advanced_errors/03_relative_uncertainties.py

I think we shouln't remove code from existing examples. But we can add of course. If this is for testing only, then please remove it from the commit.
There should be an example showcasing the pulls of course.

cverstege · 2023-06-06T11:57:11Z

kafe2/fit/_base/plot.py

@@ -659,6 +669,25 @@ def plot_residual(self, target_axes, error_contributions=('data',), **kwargs):
            **kwargs
        )

+    def plot_pull(self, target_axes, error_contributions=('data',), **kwargs):


I think the default for the shift (y location) should be the model error contribution, and the default for the y_err of the errorbar should be the data error.

cverstege · 2023-06-06T11:59:35Z

kafe2/fit/_base/plot.py

@@ -1398,6 +1439,27 @@ def plot(self, legend=True, fit_info=True, asymmetric_parameter_errors=False,
                    else:
                        _axis.set_ylim(residual_range)

+                    if pull:
+                        _axis = self._current_axes['pull']
+                        _pull_label = kc('fit', 'plot', 'pull_label')


There is no default value implemented. This needs to be added here

kafe2/kafe2/config/kafe2.yaml

Line 32 in d904e68

residual_label: 'Residual'

Same as the residual label.

GuenterQuast · 2023-06-08T10:08:00Z

Hallo Zusammen,

ich habe den Pull-Plot angeschaut und war auch etwas verwirrt.
Fehlerbalken oder Unsicherheitsband ergeben keinen Sinn, weil sie eine andere Skala benötigen
als die Pull-Größe, die in Einheiten der Unsicherheit gemessen wird.

Der Vorteil eines Echten Pull-Plots wären Einfachheit und Klarheit:
lediglich die Abweichung eines jeden Punktes von Null muss gezeigt werden.
Dazu reicht ein Balkendiagramm; man kann auch die 1- und 2- Sigmabereiche
z.b. grün und gelb einfärben (s. z.B. https://pyhf.github.io/pyhf-tutorial/PullPlot.html).

Grüße,
Günter

JohannesGaessler · 2023-06-08T15:49:07Z

When you make a PR please write at least one short sentence that describes the changes for bookkeeping. Also attaching a plot would be useful. This is the plot that I got when running the example that you modified:

I didn't look at the code yet but I can already tell that there is an issue. The label for the pull plot is the same as for the regular plot but that is incorrect since the values shown there are not the same. I think the label should simply be "Pull" in equivalence to ratio plots where it's simply "Ratio".

JohannesGaessler

As Cedric said, please don't reformat unrelated parts of the code when you make functional changes. It makes it more difficult for us to review your changes. Using my IDE I can filter these changes out but on Github you can only do this partially by hiding whitespace changes and those can have actual functional implications for Python so it's not ideal.

JohannesGaessler · 2023-06-08T15:57:05Z

kafe2/fit/_base/plot.py

+
+        :param matplotlib.axes.Axes target_axes: The :py:obj:`matplotlib` axes used for plotting.
+        :param error_contributions: Which error contributions to include when plotting the data.
+            Can either be ``data``, ``'model'`` or both.


The use of quotation marks is inconsistent here. Since it seems to be the result of copy-pasting, please fix it for the other methods as well.

JohannesGaessler · 2023-06-08T16:04:34Z

kafe2/fit/_base/plot.py

-             plot_width_share=0.5, font_scale=1.0, figsize=None):
+             pull=False, pull_range=None, pull_height_share=0.25,
+             plot_width_share=0.5, figsize=None,
+             font_scale=1.0):


What is your reason for changing the order of figsize and font_scale?

cverstege · 2023-06-09T09:32:25Z

ich habe den Pull-Plot angeschaut und war auch etwas verwirrt. Fehlerbalken oder Unsicherheitsband ergeben keinen Sinn, weil sie eine andere Skala benötigen als die Pull-Größe, die in Einheiten der Unsicherheit gemessen wird.

Genau das habe ich mit #219 (comment) gemeint. Das Fehlerband ist in einem Pull-Plot der Definition nach 1. Die Datenpunkte sind die Differenz der Daten mit dem Fit normiert mit der Unsicherheit des Fits (pull = (fit-data)/sigma_fit). Das hat zur Folge, dass das Fit-Fehlerband im Pull Plot dann bei +/-1 sein muss. Die Fehlerbalken der Datenpunkte werden mit der Unsicherheit des Fits skaliert (sigma_pull = sigma_data/sigma_fit). Einfache Fehlerfortpflanzung der Pull-Definition.

Implemented option to plot pulls

47fb1dc

cverstege self-requested a review June 6, 2023 11:41

cverstege reviewed Jun 6, 2023

View reviewed changes

JohannesGaessler reviewed Jun 8, 2023

View reviewed changes

TimLdl marked this pull request as draft June 30, 2023 16:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented option to plot pulls #219

Implemented option to plot pulls #219

TimLdl commented Jun 6, 2023

cverstege left a comment

cverstege Jun 6, 2023

cverstege Jun 6, 2023

cverstege Jun 6, 2023

cverstege Jun 6, 2023

JohannesGaessler Jun 8, 2023

cverstege Jun 6, 2023

cverstege Jun 6, 2023

cverstege Jun 6, 2023

GuenterQuast commented Jun 8, 2023

JohannesGaessler commented Jun 8, 2023

JohannesGaessler left a comment

JohannesGaessler Jun 8, 2023

JohannesGaessler Jun 8, 2023

cverstege commented Jun 9, 2023 •

edited

Loading

		def plot_pull(self, target_axes, error_contributions=('data',), **kwargs):
		raise TypeError("Pull cannot be plotted for unbinned fits.")

Implemented option to plot pulls #219

Are you sure you want to change the base?

Implemented option to plot pulls #219

Conversation

TimLdl commented Jun 6, 2023

cverstege left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

GuenterQuast commented Jun 8, 2023

JohannesGaessler commented Jun 8, 2023

JohannesGaessler left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cverstege commented Jun 9, 2023 • edited Loading

cverstege commented Jun 9, 2023 •

edited

Loading