cc COMPUTERISED OUTPUT OF PHONI TIC CODIS IN ...

19
B.A.R.C-135Q cc 35 O CO 05 COMPUTERISED OUTPUT OF PHONI TIC CODIS IN DKVANAGARI SCRIPT MY DOT-MATRiX PRINTI RS hy S. Somiisundarain. M. M. K. Suri ;md R. Kh.cua Health Phvsiv.s Division 1987

Transcript of cc COMPUTERISED OUTPUT OF PHONI TIC CODIS IN ...

B.A.R.C-135Q

cc

35O

CO

05

COMPUTERISED OUTPUT OF PHONI TIC CODIS INDKVANAGARI SCRIPT MY DOT-MATRiX PRINTI RS

hy

S. Somiisundarain. M. M. K. Suri ;md R. Kh.cuaHealth Phvsiv.s Division

1987

B.A.R.C. - 1358

GOVERNMENT OF INDIAATOMIC ENERGY COMMISSION

CO

f\< ^

I

Ua!

CO

COMPUTERISED OUTPUT OF PHONETIC CODES IN

DEVANAGARI SCRIPT BY DOT-MATRIX PRINTERS

by

S. Somasundaram, M.M.K. Suri and R. KhatuaHealth Physics Division

BHABHA ATOMIC RESEARCH CENTREBOMBAY, INDIA

1987

BARC - 1358

INIS Subject Category : F51.00

Descriptors

IDENTIFICATION SYSTEMS

COMPUTER CODES

INDIA

RADIATION DOSES

PERSONNEL

ABSTRAT

This report describes the development of a compviter software

for converting hex-octal, alpha-numeric and pure-alpha mode input in

English into 'phonetic Devanagari characters', which can be printed

through dot-matrix printers in 2 passes of print-head, along with

English text in the same lines.

If multilingual terminals presently available in India, are

used, it requires 4 passes of print-head for printing phonetic

Devanagari characters, and English text- also is converted into

phonetic Devanagari script during printing.

Thus, the software reported in this, is an improvement over

the facilities currently available in Indian market.

Computerised Output of Phonetic Codes in

Devanagari Script by Dot-matrix Printers

1. INTRODUCTION

There is a proposal [1] to allot pernanent 4-digit self-

checking unique identification phonetic code numbers to radiation

workers in India. The main characteristics of this numbering system

are:

(1) Phonetic codes will be constructed using 128

phonetic characters (Table 1) made with sixteen

consonant sounds (Table 2) and 8 vowel sounds

(Table 3) [2] taken from Indian languages.

(2) Each character of a phonetic code (consisting of

a consonant followed by a vowel) should occupy

only one byte of computer memory (Table 4);

(3) The 16 consonants used in phonetic codes will be

entered into the computer using hexadecimal code,

i.e., by 0 through 9 and A through F (Table 5);

(4) The 8 vowels used in phonetic codes will be input

into the computer using octal codes, that is, by

0 through 7 ( Table 6);

(5) The hexadecimal part, corresponding to consonant,

will occupy 4 bits of a byte (Table 4);

(6) The octal part, corresponding to the vowel, will

occupy the next 3 bits (5th to 7th) (Table 4).

In 1905, there were about 32,400 radiation workers in India

using personnel monitoring thermoluminescent and/or film badges from

Division of Radiological Protection, BARC and this number is

increasing at the rate of nearly 1000 per year during the past

several years [1]. Thus, if one phonetic character (consisting of

one consonant and one vowel) can be accommodated in one byte

(instead of two bytes — one for the consonant and the other for

vowel), there will be savings of nearly 13OK bytes of conputer memory

for storing the permanent phonetic code numbers of 32,400 radiation

workers in India.

2. 00R REQUIREMENTS

The input regarding the "Phonetic Code" can be, at present,

done in English. 16 soft consonant sounds taken from Indian

languages are phonetic sounds of K ( eft ), G (as in 'Qun' - J] ), £h.

(as in 'Chalk1 - =£[ ), £hH (as in fibhad - # ), J ( 5f ), 'I' V s in1 Istanbul1 - cf ), 'Th' (as in 'Thorn' - 3J ), p. (as "in 'Su&in' -

^ ) , N ( ?f^), P ( q- ), B ( § ), N { ff ), R ( ̂ ), L ( 51 ),

S' (as in 'Sun' - ?f ) and 'H' (as in 'Horse' - S S'S'

The 8 vowels sounds taken from Indian languages are on (as

in 'Bought' - 3f ), & (as in 'car' - 3̂ J ), 'I' (as in 'six' -

<J ), 2 (as in 'put' - 3 J, £ (as in 'thfin1 - n ), §i (as in

'Bairam Khan' - n ), 2 (as in 'so' 3f[ ) and aji (as in '

Arabia' - $

1. DIFFICULTIES IN ENTERING PHONETIC CODES

It is clear from the previous discussions that some of the

16 consonant sounds and the 8 vowel sounds of Indian languages, do

not have single-character English alphabetic letters. And if one has

to depress more than one key for a single consonant or vowel sound,

it would take longer tine for entering the data, in addition to the

accompanying chances of committing more mistakes, than depressing

one key for one sound (consonant or vowel).

With the help of hexa-decimal code, each of the above-

mentioned 16 consonant sounds of Indian languages, can be entered by

single depression of corresponding keys (Table 5). Similarly, each

of the 8 vowel sounds also can be entered by single depression of

single keys (Table 6).

4. REQUIREMENTS FOR PRINTING INDIAN SCRIPTS

Characters of Indian language scripts require much more

height for printing than those in English,

(1) partly because of 'Upper1 and 'Lower' annotations

for denoting the accompanying vowel sounds,

and

(2) partly because of a number of bends and several

horizontal parts in some of the consonants of the

Indian languages.

5. LIMITATIONS OF EXISTING PRINTERS

Since the majority of line-printers or dot-matrix printers

available in Indian market, are developed and manufactured in the

Western countries, they are equipped with interphases for printing

English letters in general, and Greek letters in a few.

Moreover, line printers are provided with fixed character

sets and, as such, no additional characters can be printed through

them.

6. EXISTING SOFTWARES

Some Indian companies have developed Multi-lingual Terminals

(MLT) cap?Jle of handling Indian language scripts. One of them was

requested to demonstrate the MLT.

It was found that the MLTs available in Indian market can

print Oevanagari script using dot-matrix printers, with the

following constraints:

(1) it requires 4 consecutive lines to print every

line of Devanagari script (i.e., in 4-pass of

print-head -- two lines for printing consonants,

one line for upper annotations for accompanying

vowels, and one line for lower ones);

(2) when MLT prints Oevanagari script, it outputs

everything (including English text also) in the

Devanagari script only.

Hence, they cannot serve our purpose, as we require to print,

only phonetic codes in Devanagari script and all other information

in English on the same lines.

7. OUR SOFTWARE

With the help of micro-computers, a program in FORTRAN has

been developed.

At the start of execution of this program, the user has the

option to select any one of the following modes for input:

(1) hex-octal codes (Table 5 and Table 6),

(2) alpha-numeric mode (Table 7), or

(3) pure-alpha node (Tab^e 8).

After the user enters the reply to the query as to the mode

of input, the computer will enquire about the number of phonetic

characters in each code. Upon getting the reply to this query, the

computer will get ready to accept the input for the required

phonetic codes in the specified mode only. Any wrong entry will be

ignored and the computer will request the user to feed the correct

data again.

When the input is made in the correct mode and in the

permissible combinations, the program will read pairs of the

specified character-codes, store each pair in one byte of computer

memory (Table 4), convert them one by one into the corresponding

Phonetic codes and print these codes in Devanagari script in two

consecutive lines, (that is in two-passes of the print head,) using

a dot-matrix printer.

A sample of phonetic characters printed by a dot-matrix

printer, is shown in Table 9.

8. CONCLUSION

Our computer software is an inprovement over the existing

facilities presently available in the market in India.

9. ACKNOWLEDGEMENT

-. The authors are grateful to Dr. D.N. Srivastava, Electronics

Systems Section, and Shri S.J. Choitramani, Health Physics Division,

for discussions.

They are also thankful to Shri S.D. Soman, Associate

Director, Radiological Group, and Shri T. Subbaratnam, Head, Health

Physics Division, for their keen interest in this work.

References:

1. R. Khatua, D. N. Srivastava, and S. Somasundaram, "Permanent

Phonetic Identification Code for Radiation Workers", BARC-T'7

(1987).

2. D. N. Srivastava,"Self-checking All purpose Phonetic Identity

Number System for Internal Security and Welfare", BARC-1287 (1985).

Table 1. 128 Phonetic Characters made by 16 consonants & 8 vov/els

T>

W

ft!

*

ift

ft

ft,

n.

if

nt

srt

0

1

2

•TA 3

4

5

6

7

8

9

i"r 1 0

1 1112

13

14

15

* 16

T̂T 17

ft, * 18

1- 1 19* 20

* 21

* 22

* 23

e 24

5?T 2 5

fe. 5?r 2 6

1. 1 27

B- 28

•v *» v

sr 30

& 31

ft.

sft

Tft

ft,

?•

err

eft

32

33

*r 34

^ 35

36

37

38

39

40

41

T"r 42

5 43

44

45

46

47

v 48

«rr 49

ft, *fr 50

g. v 51

*r 52

* 53

«r 54

«rt 55

? 56

*r 57

ft, ft 58

I- % 59

tr 60

f 61

ij 62

tf 63

ft,

3.

sft

T

Tt

H .

s.IT

•ft

64

65

*ft 66

1 67

68

69

70

71

72

73

"ft 74

1 75

76

77

78

79

ft,

4

9"T

?1

JT

TT

ft.

•J.if

*T

80

81

aft 82

« 83

84

85

S6

87

88

S9

«ft 90

* ^192

93

94

95

X

ft,

\,

\

X

ft

^t

f-,

?̂.

96

97

ft 98

I 99

100

101

102

103

104

105

?fr 106

« 107

108

109

110

111

T

ft, m

a1!

ff. ?J

5- S

1

112

113

114

115

116

117

118

119

no121

122

123

124

125

126

127

Table 2. 16 Consonants usad in Phonetic Characters

Table

31

3 .

m

8 Vowels

(f. 0

used

(*.

i n

« )

Hionetic

1 *

Characters

aft aft

Table 4. Binary Representation of 128 Rionetic Characters

Hex Code fortconsonant *

0000

0001

0010

0011

0100

0101

OHO

0111

1000

1001

1010

1011

1100

1101

1110

111!

Octal Codefor vowel

I"

000

3T0000000

0001000

0010000

0011000

0100000

0101000

on ooooon IOOO

1000000

1001000

1010000

1011000

1100000

1101000

111 0000

1111000

001

37T

0000001

0001001

0010001

0011001

0100001

0101001

0110001

oi nooi1 1000001

1001001

101000)

1011001

1100001

1 101001

1 110001

m ioo!

OiC

0000010

0001010

0010010

0011010

0100010

0101010

onooio

01 I 1010

1000010

1001010

1010010

lonoio

1100010

1101010

mooioi moio

on

00000I

000101

001001

00110]

OiOOOl

010101

011001

OHIO!

I00001

10010!

101001

101101

110001

110101

111001

111101

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

100

<?.*•:=0000100

0001100

0010100

0011100

0100100

0101100

0110100

on noo1000100

1001100

1010100

1011100

nooioo

1101100

1iiulOO

i11!100

101

j) £ •

0000101

0001101

0010101

0011101

0100101

0101101

onoioi

0111101

1000101

loonoi

1010101

10J110J

nooioi

1101101

njoioi

11J1101

no

0000110

oooi no

0010110

0011110

0100110

0101110

0110110

oin nolooono

1001110

1010110

J0J1110

nooiro

1101110

1110110

inino

Ml

0000111

0001111

0010111

0011111

03001 11

0101111

0110111

omm1000111

IOOI m

1010111

10JU1J

1200111

1101111

1110111

1111111

Tabl* 5* Hexadecimal representation »f 16 c*na*uants

C«n»«nant

HexadeciaalC*de

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Binaryrepresentation

0000

0001

0010

001 1

0|00

0|O|

0||0

oni

1000

1001

1010

1011 "'

II00

1 | 0 |

mo1 I I I

Table 0. Octal representation «f 8 vvwels

V«rel

3f

3fT

%,43,3.

atat

OctalC«de

0

1

2

3

4

5

6

7

Binaryrepresentati*D

000

001

010

onI 00

101

noin

T a b l e 7 . '••'^rriN^ PMO:."-:TIC NUMSRAT.S IN ENGLISH (PURE ALPHA MODE)

Phonecic

KZ

!%I

K'J

f

:<Y

:<o

:•:.>:

." t«O ^

GA

GI

GU

GE

GY

GC

GW

Dec.

0

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

Phonetic

C.

CA

Cl

CJ

CE

CY

CO

cv

QZ

QA

OY

QO

qw

Dec.

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

3!

I'honccic

37.

JA

JI

. ; • . :

JE

JY

JO

TZ

TA

TI

i V

TE

1 TY

TO

TV

Dec.

32

33

IU

Vj

36

37

38

39

40

41

42

43

A4

45

46

47

Phonecic

FZ

FA

FI

pv

FE

• FY

FO

FW

D2

DA

DI

,DL-

DE

DY

DO

DW

Dec

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

Phonetic

N:-:

HA

HI

MU

NE

NY

NO

?Z

PA

PI

FU

Pt

PY

PO

PW

Dec.

64

65

66

67

68

69

7C

71

72

73

74

75

76

77

7H

79

Phoni-r ic

3 A

'•: t

• : • -

:>£

3Y

30

SV

M2

MA

MI

MU

ME

MY

MO

MW

Dec.

80

8!

82

83

84

85

86

87

88

89

90

91

92

93

94

95

Phonetic

RZ

RA

urR-"

sv

KO

sw

• L Z

LA

LI

[.::

LH

LY

LO

LvJ

Dec,

96

97

93

99

100

10!

102

103

104

105

106

107

108

109

no

It

Phonetic

SZ

SA

SI

SU

SE

SY

SO

5y

HZ

HA

HI

HU

HE

HY

HO"

. _.—.--.„

Dec- j

l!2

113

P.4

ISS

126

117

118

119

120

121

122

123

124

125

126

127

T a b l e 6 . WRITING PHONETIC NUMERALS IN LNC-LISW (ALPHA-NUMERIC MODE)

Phonecic

KO

Kl

K2

K3

KA

*5

K6

K7

GO

GI

G2

G3

G<i

G5

C6

G7

Dec.

0

1

2

3 '

A

5

6

7

8

9

10

11

12

13

14

15

Phonetic

CO

Ci

C2

C3

CA

C5

C6

C7

QO

Ql

Q2

Q3

Q5

Q6

Q?

Doc.

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

Phonetic

JO

Jl

J2

J3

JA

J5

J6

J7

TO

Tl

T3

TA

T5

T6

T7

Dec.

32

33

34

35

36

37

38

39

40

41

42

A3

44

45

46

47

Phon.2i.ic

FO

F !

F2

?3

FA

?5

F6

F7

DO

Dl

32

03

DA

D5

D6

07

Dec.

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

Phonecic

HO

HI

V2

N3

NA

N3

N6

N7

PO

?1

?:

?3

PA

P5

P6

?7

Dec.

64

65

66

67

68

o9

70

71

72

73

74

75

76

77

78

79

Phonetic

BO

SI

3?

B3

BA

35

36

37

MO

Ml

M2

M3

MA

M5

M6

M7

Dec.

80

81

62

83

84

85

86

87

£3

89

90

91

92

93

94

55

Phonecic

RO

R!

R2

R3

RA

R5

R6

R7

LO

H

L2

L3

LA

L5

L6

L7

Dec.

96

97

98

99

100

101

102

103

104

105

106

107

SOS

109

HO

11

Phonetic

SO

SI

S2

S3

SA

S3

S6

S?

HO

HI

K2

K3

HA

H5

H6

K7

,« .

112

113

114

115

116

117

118

119

120

121

122

123

J24

I2:>

126

127

i

Table 9. Phonetic Characters printed by a Dotnnatrix Printer

tttt

I by Head, Ubrcyy 0 Woimotton Services. OARC, Bomboy 400 065. Indio.