Python delete character in a string

Question

I'm stuck, I've searched several ways and I can't get the correct output.

string = "Hello! I have a Big!!! problem 666 is not a good number__$"
ns =''.join([i for i in string if i.isalpha()])
print(ns)

HelloIhaveaBigproblemisnotagoodnumber

I want this output:

Hello I have a Big problem is not a good number

Can you help me? Thank you!!

But there, you just described what you don't want. You didn't say what you want. I mean, which criteria leads to "Hello I have a Big problem is not a good number". To you want to remove digits and punctuation? Or to remove everything but letters and spaces? — chrslg
– chrslg, Commented Nov 11, 2022 at 18:11
thanks for ask, I want to remove the non-alphabetic characters, but I want to get the string as a sentence separated word by word — JRT
– JRT, Commented Nov 11, 2022 at 18:13

tdelaney · Accepted Answer · 2022-11-11 18:42:48Z

2

You could increase the conditions used for each character, e.g.,

ns =''.join([i for i in string if i == " " or i.isalpha()])

But there is a problem with sequences like "666 " that leave an extra space in the text.

Instead, you could use a regex to break the string down into a list of words and intervening non-word text. Filter out the stuff you don't want, and then remove any items where the word itself went to zero size.

import re

string = "Hello! I have a Big!!! problem 666 is not a good number__$"
tmp = []

for word, other in re.findall(r"(\w+)([^\w]*)", string):
    # strip non-alpha
    word = "".join(c for c in word if c.isalpha())
    # preserve only spaces
    other = "".join(c for c in other if c == " ")
    # only add if word still exists
    if word:
        tmp.append(word + other)
ns = "".join(tmp)
print(ns)

Output

Hello I have a Big problem is not a good number

answered Nov 11, 2022 at 18:42

tdelaney

77.9k6 gold badges91 silver badges129 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Bhargav Over a year ago

Pure pythonic!! +1 from my side

Bhargav · Accepted Answer · 2022-11-11 18:19:41Z

2

Filter uisng re & remove them using re.sub

import re
string = "Hello! I have a Big!!! problem 666 is not a good number__$"
print (re.sub('[^a-zA-Z]+', ' ', string))

output #

Hello I have a Big problem is not a good number

answered Nov 11, 2022 at 18:19

Bhargav

4,8912 gold badges12 silver badges29 bronze badges

1 Comment

tdelaney Over a year ago

With a warning: It doesn't work for non-English alphabets.

segev_gr · Accepted Answer · 2022-11-11 18:20:52Z

0

You can use:

import re

string = "Hello! I have a Big!!! problem 666 is not a good number__$"
regex = re.compile('[^a-zA-Z ]')
print(regex.sub('', string))

And it will print:

Hello I have a Big problem is not a good number

(With double space between "problem" and "is")

If you want to remove the double space you write it like this:

import re

string = "Hello! I have a Big!!! problem 666 is not a good number__$"
regex = re.compile('[^a-zA-Z ]')
string = regex.sub('', string)
print(string.replace('  ', ' '))

And now the output will be:

Hello I have a Big problem is not a good number

answered Nov 11, 2022 at 18:20

segev_gr

1111 gold badge1 silver badge13 bronze badges

Comments

C-3PO · Accepted Answer · 2022-11-11 18:25:15Z

0

A lot of simple problems can be solved without the re library.

For this case, you can filter all characters that are not in the alphabet, or empty spaces:

from string import ascii_lowercase
ns = ''.join(filter(lambda c: c.lower() in ascii_lowercase+' ', s))
while '  ' in ns: ns = ns.replace('  ',' ')


# output:
# 'Hello I have a Big problem is not a good number'

Repeated spaces are filtered in the one-liner while-loop found above.

To work with non-english characters, you can replace ascii_lowercase with the desired choice of characters.

edited Nov 11, 2022 at 18:25

answered Nov 11, 2022 at 18:16

C-3PO

1,23312 silver badges18 bronze badges

3 Comments

chrslg Over a year ago

But then you have 2 spaces between problem and is

C-3PO Over a year ago

The updated version also solves this problem.

JRT Over a year ago

Thanks to all, I can continue with the learning of python, greetings.

Collectives™ on Stack Overflow

Python delete character in a string

4 Answers 4

1 Comment

1 Comment

Comments

3 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

1 Comment

1 Comment

Comments

3 Comments

Your Answer

Sign up or log in

Post as a guest

Related