Computing clusters with systems with equal output

How to compute the system ranking clusters if systems often produce the same output and are merged in the results CSV file? Is using the [`scripts/compute_ranking_clusters.perl`](https://github.com/cfedermann/Appraise/blob/master/scripts/compute_ranking_clusters.perl) script the correct way?

This script seems to ignore merged systems in the results CSV file (`sysA+sysB` will be treated as a separate, new system). I have fixed it [in this commit](https://github.com/tuetschek/Appraise/commit/15def149c9cc82635a8f37d846e7779c4a1f4984) in my fork. Was that the correct thing to do, or is there a better way of getting the ranking clusters?

( Without this fix, the clustering script would get stuck [in an infinite loop](https://github.com/cfedermann/Appraise/blob/master/scripts/compute_ranking_clusters.perl#L121) on my data, i.e., several variants of the same NLG system, often producing identical outputs. )


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Computing clusters with systems with equal output #55

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Computing clusters with systems with equal output #55

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions