Tasks‎ > ‎SeeDev‎ > ‎

SeeDev corpus

Corpus description


#

Train

Dev

Test

Documents

20

90%

75%

80%

Paragraphs

87

45%

22%

33%

Words

44,857

45%

23%

33%

Total entities

7,082

46%

23%

31%

Total n-ary relations (SeeDev full)

2,583

45%

23%

32%

Total binary relations (SeeDev binary)

3,575

46%

23%

32%


Distribution of relation in Train, Dev and Test sets 

Relation

#

Train

Dev

Test

Total

Where and When

704

45%

23%

32%

20%

Exists_At_Stage

33

45%

24%

30%

1%

Exists_In_Genotype

377

45%

21%

34%

11%

Occurs_During

30

27%

33%

40%

1%

Occurs_In_Genotype

48

38%

33%

29%

1%

Is_Localized_In

216

50%

22%

29%

6%

Function

257

42%

28%

30%

7%

Is_Involved_In_Process

55

42%

36%

22%

2%

Transcribes_Or_Translates_To

54

46%

24%

30%

2%

Is_Functionally_Equivalent_To

148

41%

26%

33%

4%

Regulation

1731

46%

22%

31%

48%

Regulates_Accumulation

81

44%

36%

20%

2%

Regulates_Development_Phase

242

44%

24%

32%

7%

Regulates_Expression

450

45%

25%

31%

13%

Regulates_Molecule_Activity

25

64%

0%

36%

1%

Regulates_Process

904

48%

20%

32%

25%

Regulates_Tissue_Development

29

31%

31%

38%

1%

Composition and Membership

532

44%

22%

34%

15%

Composes_Primary_Structure

51

39%

29%

31%

1%

Composes_Protein_Complex

19

84%

0%

16%

1%

Has_Sequence_Identical_To

126

49%

16%

35%

4%

Is_Member_Of_Family

230

39%

24%

37%

6%

Is_Protein_Domain_Of

106

43%

27%

29%

3%

Interaction

264

46%

21%

33%

7%

Interacts_With

148

42%

22%

36%

4%

Binds_To

116

52%

21%

28%

3%

Specific to Binary Framework

87

51%

26%

23%

2%

Is_Linked_To

87

51%

26%

23%

2%

Total

3575

46%

23%

32%

100%