Evaluating Dialogue Systems via an Opinion (doi:10.60507/FK2/FX37GD)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

Evaluating Dialogue Systems via an Opinion

Identification Number:

doi:10.60507/FK2/FX37GD

Distributor:

bonndata

Date of Distribution:

2023-06-05

Version:

1

Bibliographic Citation:

Nedelchev, Rostislav, 2023, "Evaluating Dialogue Systems via an Opinion", https://doi.org/10.60507/FK2/FX37GD, bonndata, V1

Study Description

Citation

Title:

Evaluating Dialogue Systems via an Opinion

Identification Number:

doi:10.60507/FK2/FX37GD

Authoring Entity:

Nedelchev, Rostislav

Software used in Production:

Python

Software used in Production:

virtual environment like (Ana|Mini)conda

Distributor:

bonndata

Access Authority:

Nedelchev, Rostislav

Depositor:

Nedelchev, Rostislav

Date of Deposit:

2023-05-20

Holdings Information:

https://doi.org/10.60507/FK2/FX37GD

Study Scope

Keywords:

Computer and Information Science

Abstract:

Dialogue systems are a significant field of research and development in artificial intelligence. Until today, the evaluation of such algorithms happens in one fundamental way. They solve "hypothetical problems," i.e., dialog systems are tested by being asked to respond in specific scenarios and provide a "solution", i.e. a reply, to a "problem". The replies are either explicitly compared to reference responses using overlap-based metrics (e.g., BLEU) or are evaluated by human annotators, which can also be seen as compared to (implicit) references. Instead, we propose to ask dialogue systems to tell whether a sample "solution" to a sample "problem" is good or bad. In other words, we ask another dialog system whether a conversation is fluent and coherent or not, and to what degree. In our experiments, we show how to evaluate dialogue systems by "asking for an opinion" and that it indeed offers an additional perspective on assessing these methods.

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Publications

Citation

Title:

Nedelchev, Rostislav: Automatic Evaluation of Dialogue-Systems Using Neural-Network Methods. - Bonn, 2023. - Dissertation, Rheinische Friedrich-Wilhelms-Universität Bonn. Online-Ausgabe in bonndoc: https://nbn-resolving.org/urn:nbn:de:hbz:5-70983

Identification Number:

urn:nbn:de:hbz:5-70983

Bibliographic Citation:

Nedelchev, Rostislav: Automatic Evaluation of Dialogue-Systems Using Neural-Network Methods. - Bonn, 2023. - Dissertation, Rheinische Friedrich-Wilhelms-Universität Bonn. Online-Ausgabe in bonndoc: https://nbn-resolving.org/urn:nbn:de:hbz:5-70983

Other Reference Note(s)

Parts of the code are based on the following work: Yookoon Park and Jaemin Cho and Gunhee Kim "A Hierarchical Latent Structure for Variational Conversation Modeling", 2018, NAACL, DOI: http://dx.doi.org/10.18653/v1/N18-1162

Original code is located here: https://github.com/ctr4si/A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling

Other Study-Related Materials

Label:

convai1_preprocess.py

Notes:

text/x-python

Other Study-Related Materials

Label:

convai2_preprocess.py

Notes:

text/x-python

Other Study-Related Materials

Label:

cornell_preprocess.py

Notes:

text/x-python

Other Study-Related Materials

Label:

env.yml

Notes:

application/octet-stream

Other Study-Related Materials

Label:

Readme.md

Notes:

text/markdown

Other Study-Related Materials

Label:

ubuntu_preprocess.py

Notes:

text/x-python

Other Study-Related Materials

Label:

convai1.json

Notes:

application/json

Other Study-Related Materials

Label:

conversation_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentences.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentence_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

convai2.json

Notes:

application/json

Other Study-Related Materials

Label:

conversation_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentences.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentence_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

chameleons.pdf

Notes:

application/pdf

Other Study-Related Materials

Label:

id2word.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

movie_characters_metadata.txt

Notes:

text/plain

Other Study-Related Materials

Label:

movie_conversations.txt

Notes:

text/plain

Other Study-Related Materials

Label:

movie_lines.txt

Notes:

text/plain

Other Study-Related Materials

Label:

movie_titles_metadata.txt

Notes:

text/plain

Other Study-Related Materials

Label:

raw_script_urls.txt

Notes:

text/plain

Other Study-Related Materials

Label:

README.txt

Notes:

text/plain

Other Study-Related Materials

Label:

word2id.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

conversation_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentences.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentence_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

conversation_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentences.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentence_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

conversation_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentences.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

sentence_length.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

configs.py

Notes:

text/x-python

Other Study-Related Materials

Label:

convai1_results.pickle.bz2

Notes:

application/x-bzip2

Other Study-Related Materials

Label:

convai2_results.pickle.bz2

Notes:

application/x-bzip2

Other Study-Related Materials

Label:

correlations.ipynb

Notes:

application/x-ipynb+json

Other Study-Related Materials

Label:

data_loader.py

Notes:

text/x-python

Other Study-Related Materials

Label:

eval.py

Notes:

text/x-python

Other Study-Related Materials

Label:

eval_4models_convai.sh

Notes:

application/x-sh

Other Study-Related Materials

Label:

eval_convai.py

Notes:

text/x-python

Other Study-Related Materials

Label:

eval_embed.py

Notes:

text/x-python

Other Study-Related Materials

Label:

gpt_convai_eval.py

Notes:

text/x-python

Other Study-Related Materials

Label:

solver.py

Notes:

text/x-python

Other Study-Related Materials

Label:

sqlitedict_compress.py

Notes:

text/x-python

Other Study-Related Materials

Label:

train.py

Notes:

text/x-python

Other Study-Related Materials

Label:

train_4models.sh

Notes:

application/x-sh

Other Study-Related Materials

Label:

__init__.py

Notes:

text/x-python

Other Study-Related Materials

Label:

30.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

config.txt

Notes:

text/plain

Other Study-Related Materials

Label:

convai1.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

convai2.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

events.out.tfevents.1604760827.ff8fc1a53fe2

Notes:

application/octet-stream

Other Study-Related Materials

Label:

samples.txt

Notes:

text/plain

Other Study-Related Materials

Label:

30.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

config.txt

Notes:

text/plain

Other Study-Related Materials

Label:

convai1.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

convai2.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

events.out.tfevents.1604756310.ff8fc1a53fe2

Notes:

application/octet-stream

Other Study-Related Materials

Label:

samples.txt

Notes:

text/plain

Other Study-Related Materials

Label:

30.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

config.txt

Notes:

text/plain

Other Study-Related Materials

Label:

convai1.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

convai2.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

events.out.tfevents.1604773321.ff8fc1a53fe2

Notes:

application/octet-stream

Other Study-Related Materials

Label:

samples.txt

Notes:

text/plain

Other Study-Related Materials

Label:

30.pkl

Notes:

application/octet-stream

Other Study-Related Materials

Label:

config.txt

Notes:

text/plain

Other Study-Related Materials

Label:

convai1.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

convai2.sqlite

Notes:

application/octet-stream

Other Study-Related Materials

Label:

events.out.tfevents.1604766709.ff8fc1a53fe2

Notes:

application/octet-stream

Other Study-Related Materials

Label:

samples.txt

Notes:

text/plain

Other Study-Related Materials

Label:

ae_mapper.py

Notes:

text/x-python

Other Study-Related Materials

Label:

hred.py

Notes:

text/x-python

Other Study-Related Materials

Label:

seq2seq.py

Notes:

text/x-python

Other Study-Related Materials

Label:

vhcr.py

Notes:

text/x-python

Other Study-Related Materials

Label:

vhred.py

Notes:

text/x-python

Other Study-Related Materials

Label:

__init__.py

Notes:

text/x-python

Other Study-Related Materials

Label:

beam_search.py

Notes:

text/x-python

Other Study-Related Materials

Label:

decoder.py

Notes:

text/x-python

Other Study-Related Materials

Label:

encoder.py

Notes:

text/x-python

Other Study-Related Materials

Label:

feedforward.py

Notes:

text/x-python

Other Study-Related Materials

Label:

loss.py

Notes:

text/x-python

Other Study-Related Materials

Label:

rnncells.py

Notes:

text/x-python

Other Study-Related Materials

Label:

__init__.py

Notes:

text/x-python

Other Study-Related Materials

Label:

bow.py

Notes:

text/x-python

Other Study-Related Materials

Label:

convert.py

Notes:

text/x-python

Other Study-Related Materials

Label:

embedding_metric.py

Notes:

text/x-python

Other Study-Related Materials

Label:

mask.py

Notes:

text/x-python

Other Study-Related Materials

Label:

pad.py

Notes:

text/x-python

Other Study-Related Materials

Label:

probability.py

Notes:

text/x-python

Other Study-Related Materials

Label:

tensorboard.py

Notes:

text/x-python

Other Study-Related Materials

Label:

time_track.py

Notes:

text/x-python

Other Study-Related Materials

Label:

tokenizer.py

Notes:

text/x-python

Other Study-Related Materials

Label:

vocab.py

Notes:

text/x-python

Other Study-Related Materials

Label:

__init__.py

Notes:

text/x-python