View: |
Part 1: Document Description
|
Citation |
|
---|---|
Title: |
Evaluating Dialogue Systems via an Opinion |
Identification Number: |
doi:10.60507/FK2/FX37GD |
Distributor: |
bonndata |
Date of Distribution: |
2023-06-05 |
Version: |
1 |
Bibliographic Citation: |
Nedelchev, Rostislav, 2023, "Evaluating Dialogue Systems via an Opinion", https://doi.org/10.60507/FK2/FX37GD, bonndata, V1 |
Citation |
|
Title: |
Evaluating Dialogue Systems via an Opinion |
Identification Number: |
doi:10.60507/FK2/FX37GD |
Authoring Entity: |
Nedelchev, Rostislav |
Software used in Production: |
Python |
Software used in Production: |
virtual environment like (Ana|Mini)conda |
Distributor: |
bonndata |
Access Authority: |
Nedelchev, Rostislav |
Depositor: |
Nedelchev, Rostislav |
Date of Deposit: |
2023-05-20 |
Holdings Information: |
https://doi.org/10.60507/FK2/FX37GD |
Study Scope |
|
Keywords: |
Computer and Information Science |
Abstract: |
Dialogue systems are a significant field of research and development in artificial intelligence. Until today, the evaluation of such algorithms happens in one fundamental way. They solve "hypothetical problems," i.e., dialog systems are tested by being asked to respond in specific scenarios and provide a "solution", i.e. a reply, to a "problem". The replies are either explicitly compared to reference responses using overlap-based metrics (e.g., BLEU) or are evaluated by human annotators, which can also be seen as compared to (implicit) references. Instead, we propose to ask dialogue systems to tell whether a sample "solution" to a sample "problem" is good or bad. In other words, we ask another dialog system whether a conversation is fluent and coherent or not, and to what degree. In our experiments, we show how to evaluate dialogue systems by "asking for an opinion" and that it indeed offers an additional perspective on assessing these methods. |
Methodology and Processing |
|
Sources Statement |
|
Data Access |
|
Other Study Description Materials |
|
Related Publications |
|
Citation |
|
Title: |
Nedelchev, Rostislav: Automatic Evaluation of Dialogue-Systems Using Neural-Network Methods. - Bonn, 2023. - Dissertation, Rheinische Friedrich-Wilhelms-Universität Bonn. Online-Ausgabe in bonndoc: https://nbn-resolving.org/urn:nbn:de:hbz:5-70983 |
Identification Number: |
urn:nbn:de:hbz:5-70983 |
Bibliographic Citation: |
Nedelchev, Rostislav: Automatic Evaluation of Dialogue-Systems Using Neural-Network Methods. - Bonn, 2023. - Dissertation, Rheinische Friedrich-Wilhelms-Universität Bonn. Online-Ausgabe in bonndoc: https://nbn-resolving.org/urn:nbn:de:hbz:5-70983 |
Other Reference Note(s) |
|
Parts of the code are based on the following work: Yookoon Park and Jaemin Cho and Gunhee Kim "A Hierarchical Latent Structure for Variational Conversation Modeling", 2018, NAACL, DOI: http://dx.doi.org/10.18653/v1/N18-1162 |
|
Original code is located here: https://github.com/ctr4si/A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling |
|
Label: |
convai1_preprocess.py |
Notes: |
text/x-python |
Label: |
convai2_preprocess.py |
Notes: |
text/x-python |
Label: |
cornell_preprocess.py |
Notes: |
text/x-python |
Label: |
env.yml |
Notes: |
application/octet-stream |
Label: |
Readme.md |
Notes: |
text/markdown |
Label: |
ubuntu_preprocess.py |
Notes: |
text/x-python |
Label: |
convai1.json |
Notes: |
application/json |
Label: |
conversation_length.pkl |
Notes: |
application/octet-stream |
Label: |
sentences.pkl |
Notes: |
application/octet-stream |
Label: |
sentence_length.pkl |
Notes: |
application/octet-stream |
Label: |
convai2.json |
Notes: |
application/json |
Label: |
conversation_length.pkl |
Notes: |
application/octet-stream |
Label: |
sentences.pkl |
Notes: |
application/octet-stream |
Label: |
sentence_length.pkl |
Notes: |
application/octet-stream |
Label: |
chameleons.pdf |
Notes: |
application/pdf |
Label: |
id2word.pkl |
Notes: |
application/octet-stream |
Label: |
movie_characters_metadata.txt |
Notes: |
text/plain |
Label: |
movie_conversations.txt |
Notes: |
text/plain |
Label: |
movie_lines.txt |
Notes: |
text/plain |
Label: |
movie_titles_metadata.txt |
Notes: |
text/plain |
Label: |
raw_script_urls.txt |
Notes: |
text/plain |
Label: |
README.txt |
Notes: |
text/plain |
Label: |
word2id.pkl |
Notes: |
application/octet-stream |
Label: |
conversation_length.pkl |
Notes: |
application/octet-stream |
Label: |
sentences.pkl |
Notes: |
application/octet-stream |
Label: |
sentence_length.pkl |
Notes: |
application/octet-stream |
Label: |
conversation_length.pkl |
Notes: |
application/octet-stream |
Label: |
sentences.pkl |
Notes: |
application/octet-stream |
Label: |
sentence_length.pkl |
Notes: |
application/octet-stream |
Label: |
conversation_length.pkl |
Notes: |
application/octet-stream |
Label: |
sentences.pkl |
Notes: |
application/octet-stream |
Label: |
sentence_length.pkl |
Notes: |
application/octet-stream |
Label: |
configs.py |
Notes: |
text/x-python |
Label: |
convai1_results.pickle.bz2 |
Notes: |
application/x-bzip2 |
Label: |
convai2_results.pickle.bz2 |
Notes: |
application/x-bzip2 |
Label: |
correlations.ipynb |
Notes: |
application/x-ipynb+json |
Label: |
data_loader.py |
Notes: |
text/x-python |
Label: |
eval.py |
Notes: |
text/x-python |
Label: |
eval_4models_convai.sh |
Notes: |
application/x-sh |
Label: |
eval_convai.py |
Notes: |
text/x-python |
Label: |
eval_embed.py |
Notes: |
text/x-python |
Label: |
gpt_convai_eval.py |
Notes: |
text/x-python |
Label: |
solver.py |
Notes: |
text/x-python |
Label: |
sqlitedict_compress.py |
Notes: |
text/x-python |
Label: |
train.py |
Notes: |
text/x-python |
Label: |
train_4models.sh |
Notes: |
application/x-sh |
Label: |
__init__.py |
Notes: |
text/x-python |
Label: |
30.pkl |
Notes: |
application/octet-stream |
Label: |
config.txt |
Notes: |
text/plain |
Label: |
convai1.sqlite |
Notes: |
application/octet-stream |
Label: |
convai2.sqlite |
Notes: |
application/octet-stream |
Label: |
events.out.tfevents.1604760827.ff8fc1a53fe2 |
Notes: |
application/octet-stream |
Label: |
samples.txt |
Notes: |
text/plain |
Label: |
30.pkl |
Notes: |
application/octet-stream |
Label: |
config.txt |
Notes: |
text/plain |
Label: |
convai1.sqlite |
Notes: |
application/octet-stream |
Label: |
convai2.sqlite |
Notes: |
application/octet-stream |
Label: |
events.out.tfevents.1604756310.ff8fc1a53fe2 |
Notes: |
application/octet-stream |
Label: |
samples.txt |
Notes: |
text/plain |
Label: |
30.pkl |
Notes: |
application/octet-stream |
Label: |
config.txt |
Notes: |
text/plain |
Label: |
convai1.sqlite |
Notes: |
application/octet-stream |
Label: |
convai2.sqlite |
Notes: |
application/octet-stream |
Label: |
events.out.tfevents.1604773321.ff8fc1a53fe2 |
Notes: |
application/octet-stream |
Label: |
samples.txt |
Notes: |
text/plain |
Label: |
30.pkl |
Notes: |
application/octet-stream |
Label: |
config.txt |
Notes: |
text/plain |
Label: |
convai1.sqlite |
Notes: |
application/octet-stream |
Label: |
convai2.sqlite |
Notes: |
application/octet-stream |
Label: |
events.out.tfevents.1604766709.ff8fc1a53fe2 |
Notes: |
application/octet-stream |
Label: |
samples.txt |
Notes: |
text/plain |
Label: |
ae_mapper.py |
Notes: |
text/x-python |
Label: |
hred.py |
Notes: |
text/x-python |
Label: |
seq2seq.py |
Notes: |
text/x-python |
Label: |
vhcr.py |
Notes: |
text/x-python |
Label: |
vhred.py |
Notes: |
text/x-python |
Label: |
__init__.py |
Notes: |
text/x-python |
Label: |
beam_search.py |
Notes: |
text/x-python |
Label: |
decoder.py |
Notes: |
text/x-python |
Label: |
encoder.py |
Notes: |
text/x-python |
Label: |
feedforward.py |
Notes: |
text/x-python |
Label: |
loss.py |
Notes: |
text/x-python |
Label: |
rnncells.py |
Notes: |
text/x-python |
Label: |
__init__.py |
Notes: |
text/x-python |
Label: |
bow.py |
Notes: |
text/x-python |
Label: |
convert.py |
Notes: |
text/x-python |
Label: |
embedding_metric.py |
Notes: |
text/x-python |
Label: |
mask.py |
Notes: |
text/x-python |
Label: |
pad.py |
Notes: |
text/x-python |
Label: |
probability.py |
Notes: |
text/x-python |
Label: |
tensorboard.py |
Notes: |
text/x-python |
Label: |
time_track.py |
Notes: |
text/x-python |
Label: |
tokenizer.py |
Notes: |
text/x-python |
Label: |
vocab.py |
Notes: |
text/x-python |
Label: |
__init__.py |
Notes: |
text/x-python |